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Description 



The present invention relates to DNA molecules responsible for encoding the multi-functional proteins that direct 
the biosynthesis of polyketide compounds, the products encoded by said DNA molecules, recombinant DNA expression 
5 vectors, and transformed microbial host cells. 

Polyketides are a family of compounds that include a large number of structurally and functionally diverse natural 
products. For example, the polyketides provide the structural backbone for compounds that exhibit a variety of biological 
activities, such as, antibiotic, antitumor, and immunosuppressive agents. 

Although the polyketides are quite diverse as indicated, they share a common mechanistic scheme of biosynthesis. 
10 The polyketides are synthesized by the successive condensation of small carboxylic acid residues followed by variable 
reduction steps at the resulting p-keto carbon (i.e., p-carbonyl) moiety in a process that is similar to the synthesis of 
fatty acids. The iterative synthetic process for many of these polyketides is controlled by a complex of large, multi- 
functional polypeptides that have distinct sites for the variety of activities that are required. 

The genera! scheme for polyketide biosynthesis has been reviewed, for example, in Hopwood and Sherman, Annu. 
'5 Rev. Genet, 1990, 24:37 and Katz and Donadio, Annu. Rev. Microbiol., 1993, 47:875. 

Naturally-occurring DNA sequences that encode the polyketide synthase enzymes have generally been found to 
be organized into repeated subunits, or modules, each of which encodes all the activities required in a single round of 
synthesis, which includes the condensation step itself and the post-condensation processing steps. Each activity is 
associated with a distinct site, which contributes to the specificity for the particular carboxylic acid building block that 
20 is incorporated at each condensation step, or which dictates the particular post-condensation processing functions 
that will be executed. 

For example, PCT publication WO 93/13663 describes the organization of the gene encoding the polyketide syn- 
thase of Saccharopolyspora erythraea. The gene is organized in modules, with each module effecting one condensation 
step. The precise sequence of chain growth and the processing of the growing chain is determined by the genetic 
25 information in each module. This PCT application describes an approach for synthesizing novel polyketide structures 
by manipulating in several ways the DNA governing the biosynthesis of the cyclic lactone framework. In order to adapt 
this methodology to other polyketides, however, the DNA molecules directing the biosynthetic processing must first be 
isolated. 

The present invention is directed to the DNA sequence for the gene cluster responsible for encoding tylactone 
30 synthase, the building machinery of tylactone, which is the basic building block of tylosin. As a result, the present 
invention enables modification of the DNA sequence so as to alter the type of carboxylic acids incorporated, the number 
of carboxylic acids incorporated, the post-condensation reactions performed, or any combination thereof, thereby re- 
sulting in novel tylosin -related polyketides. 

The present invention provides a DNA molecule comprising an isolated DNA sequence that encodes a tylactone 
35 synthase domain. Thus, the present invention provides the DNA molecule of SEQ ID NO:1 and DNA molecules that 
contain submodules thereof. The present invention also provides the products encoded by said DNA molecules, re- 
combinant DNA expression vectors, and transformed microbial host cells. 

Figure 1 is a map of the tylactone polyketide synthase region (tylG) of the Streptomyces fradiae DNA (-45 kb). 
Distances in kb are relative to the beginning of tylG. Open reading frames (ORFs) are indicated by arrows. Restriction 
*o sites are denoted as follows: 



E - EcoRI 
B - BamH\ 

45 Predicted functional domains are labeled as follows: 



ACP - acyl carrier protein 

AT - acyltransferase 

DH - dehydratase 

50 ER - enoy I reductase 

KR - ketoreductase 

KR' - ketoreductase-like domain predicted to be inactive 

KS - ketosynthase 

KS' - ketosynthase-like domain in which a glutamine residue is present in the position occupied by an active site 

55 cysteine in a normal ketosynthase 

TE - thioesterase. 



Figure 2 depicts the biosynthetic pathway for tylactone synthesis. 
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Figure 3 is a map of the two clones that span the whole region of the tylG DNA. 

Figure 4 is a map of the srmG region of the Streptomyces ambofaciens DNA. Distances in kb are relative to the 
beginning of srmG. Open reading frames are indicated by arrows. The srmG DNA (0-42 kb) is the platenolide polyketide 
synthase region. Restriction sites are denoted as above with additions as follows: 

5 

AP - Apa\ 
G - Bgh\ 
K - Kpn\ 
P- Psti 
10 X - Xho\ 

Predicted functional domains are labeled as above. 

Figure 5 demonstrates the biosynthetic pathway for platenolide synthesis. 
Figure 6 is a map of the two clones that span the whole region of the srmG DNA. 
is Polyketides are a large class of structurally and functionally diverse natural products. A common feature of com- 

pounds in this class is that their synthesis is accomplished under the direction of a complex of multi-functional peptides, 
termed a "polyketide synthase." Molecular genetic analysis of polyketide synthase genes has revealed two distinct 
classes of enzymes operating for different polyketides: 1 ) the aromatics, which are made through an essentially iterative 
process, and 2) the complex polyketides, which comprise several repeats of the same activities arranged in few very 
20 large polypeptides. 

Among the complex polyketide synthase genes, a polyketide synthase includes enzymatic and regulatory activities 
responsible for exercising substrate specificity, catalyzing the condensation of small carboxylic acid building blocks (in 
the form of coenzyme A thioesters) to a growing polyketide carbon chain, and catalyzing the post-condensation 
processing reactions at the p-carbonyl functional group that results from the condensation reaction. 

25 The condensation reaction requires several activities including acyl carrier protein (ACP), p-ketosynthase (KS), 

and acyltransferase (AT). Once a condensation has occurred, the resulting p-carbonyl functional group may be mod- 
ified. Post-condensation activities that may be involved include p-ketoreductase (KR), dehydratase (DH), and enoylre- 
ductase (ER). Polyketide biosynthesis is terminated by a thioesterase (TE) activity. Whether all, some, or none of these 
activities act after a particular condensation step determines the structure of the final product. 

30 The present invention provides, in particular, the DNA sequence encoding the polyketide synthase responsible for 

biosynthesis of ty lactone, i.e., ty lactone synthase. Tylactone itself is the polyketide backbone of the commercially sig- 
nificant antibiotic tylosin. The tylactone synthase DNA sequence, which defines the tylactone synthase gene cluster, 
directs biosynthesis of the tylactone polyketide by encoding the various distinct activities of tylactone synthase. 

The gene cluster for tylactone synthase, like other complex polyketide biosynthetic genes whose organization has 

35 been elucidated, is characterized by the presence of several open reading frames (ORFs), each of which contains one 
or more repeated units termed "modules." A module is defined as the genetic element encoding all of the distinct 
activities required in a single round of synthesis, i.e., one condensation step and all the p-carbonyl processing steps 
associated therewith. Each module encodes an ACP, a KS, and an AT activity to accomplish the condensation portion 
of the synthesis, and selected post-condensation activities to effect p-carbonyl processing. The polypeptides encoded 

40 by such modules are termed "synthase units" (SUs). 

Each module is further characterized by the inclusion of submodules that are responsible for encoding the distinct 
activities of tylactone synthase. For purposes of the present invention, the term "submodule" is defined as the portion 
of the polyketide synthase DNA sequence that encodes a distinct activity, or "domain". Thus, a domain is taken as 
commonly understood to mean that part of the polyketide synthase polypeptide necessary for a given distinct activity. 

45 Organization of the tylactone synthase gene cluster derived from Streptomyces fradiae, is shown in Figure 1 . The 

tylactone synthetic pathway, with indications of the specific carboxylic acid substrates used for each condensation 
reaction and the various post-condensation activities, is shown in Figure 2. 

A preferred DNA molecule comprising the tylactone synthase gene cluster isolated from Streptomyces fradiae is 
represented by SEQ I D NO: 1 . Other preferred DNA molecules of the present invention include the various open reading 

50 frames of SEQ ID NO:1 that encode individual multi-functional polypeptides. These are represented by the following 
nucleotide residues of SEQ ID NO:1: ORF1 81 6 to 14234, ORF2 14351 to 19945, ORF3 20010 to 31199, ORF4 31232 
to 36067, and ORF5 36249 to 41774. The predicted amino acid sequences of the various peptides encoded by these 
sequences are shown in SEQ ID NO:2, 3, 4, 5, and 6, respectively. 

Yet other preferred DNA molecules of the present invention include the modules that encode the synthase units, 

55 which include all the activities necessary for a single round of synthesis. These are represented by the following nu- 
cleotide residues of SEQ ID NO:1: Starter Module 942 to 3929, Module 1 3993 to 8471, Module 2 8541 to 13970, 
Module 3 1 441 1 to 1 9666, Module 4 201 36 to 2461 1 , Module 5 24675 to 30902, Module 6 31 337 to 35743, and Module 
7 36360 to 40826. The predicted amino acid sequences of the various synthase units encoded by these modules are 



3 



EP 0 791 655 A2 



represented, respectively, by the following amino acid residues: Starter SU 43 to 1038, SU1 1060 to 2552 and SU2 
2576 to 4385 in SEQ ID NO:2; SU3 21 to 1772 in SEQ ID NO:3; SU4 43 to 1534 and SU5 1556 to 3631 in SEQ ID 
NO: 4; SU6 36 to 1504 in SEQ ID NO:5; and SU7 38 to 1526 in SEQ ID NO:6. 

Still other preferred DNA molecules include the various submodules that encode the various domains of tylactone 

s synthase. These submodules are represented by the following nucleotide residues: KS Q (s) 942 to 2156, AT(s) 2571 
to 3557, ACP(s) 3675 to 3929, KS1 3993 to 5264, AT1 5631 to 6617, KR1 7410 to 7949, and ACP1 8220 to 8471 of 
Module 1 in SEQ ID NO:1; KS2 8541 to 9812, AT2 10260 to 11246, DH2 11319 to 11876, KR2 12861 to 13415, and 
ACP2 13719 to 13970 of Module 2 in SEQ ID NO:1; KS3 14411 to 15697, AT3 16055 to 17122, DH3 17198 to 17794, 
KR3 18584 to 19138, and ACP3 19415 to 19666 of Module 3 in SEQ ID NO:1; KS4 20136 to 21404, AT4 21771 to 

10 22757, KR'4 23541 to 24077, and ACP4 24360 to 24611 of Module 4 in SEQ ID NO:1 ; KS5 24675 to 25949, AT5 26292 
to 27284, DH5 27360 to 27917, ER5 28767 to 29813, KR5 29829 to 30368, and ACP5 30651 to 30902 of Module 5 
in SEQ ID NO: 1 ; KS6 31 337 to 32608, AT6 32975 to 33961 , KR6 34694 to 35236, and ACP6 35492 to 35743 of Module 
6 in SEQ ID NO:1 ; KS7 36360 to 37631 , AT7 37989 to 38987, KR7 39759 to 40313, ACP7 40575 to 40826, and TE7 
41235 to 41 333 of Module 7 in SEQ ID NO:1 . 

is The predicted amino acid sequences of the various domains encoded by these submodules are represented, 

respectively, by KS Q (s) 43 to 447, AT(s) 586 to 914, ACP(s) 954 to 1038, KS1 1060 to 1483, AT1 1606 to 1 934, KR1 
2199 to 2378, and ACP1 2469 to 2552 in SEQ ID NO:2; KS2 2576 to 2999, AT2 3149 to 3477, DH2 3502 to 3687, 
KR2 4016 to 4200, and ACP2 4302 to 4385 in SEQ ID NO:2; KS3 21 to 449, AT3 569 to 924, DH3 950 to 1148, KR3 
1 41 2 to 1 596, and ACP3 1 689 to 1 772 in SEQ I D NO: 3; KS4 43 to 465, AT4 588 to 91 6, KR'4 1 1 78 to 1 356, and ACP4 

20 1 451 to 1534 in SEQ ID NO:4; KS5 1556 to 1980, AT5 2095 to 2425, DH5 2451 to 2636, KR5 3274 to 3453, and ACP5 
3548 to 3631 in SEQ ID NO:4; KS6 36 to 459, AT6 582 to 910, KR6 1155 to 1335, and ACP6 1421 to 1504 in SEQ ID 
NO:5; KS7 38 to 461, AT7 581 to 913, KR7 1171 to 1355, ACP7 1443 to 1526, andTE7 1663 to 1695 in SEQ ID NO:6. 

Although not wishing to be bound to any particular technical explanation, sequence similarity exists among domain 
boundaries in various polyketide synthase genes. Thus, one skilled in the art is able to predict the domain boundaries 

25 of newly discovered polyketide synthase genes based on the sequence information of known polyketide synthase 
genes. In particular, the boundaries of submodules, domains, and open reading frames in the instant application are 
predicted based on sequence information disclosed in the instant application and the locations of the domain bound- 
aries of the erythromycin polyketide synthase (Donadio et al., Gene 111:51 (1992)). 

The DNA sequence of the tylactone synthase gene was determined from recombinant DNA clones prepared from 

30 the DNA of Streptomyces fradiae. The tylactone synthase gene is contained in recombinant DNA vectors pSET506 
and pSET507 (Figure 3), which are available from the National Center for Agricultural Utilization Research, 1 81 5 North 
University Street, Peoria, Illinois 61 604-3999, in E coli K1 2 MM294 under accession numbers NRRL B-18688 (depos- 
ited July 19, 1990) and NRRL B-18689 (deposited July 19, 1990), respectively. 

Tehcniques for isolating bacterial DNA are readily available and well known in the art. Any such technique can be 

35 employed in this invention. In particular, DNA from these cultures is isolated as follows. Lyophils of E coli K12 
MM294/pSET506 or E. coli K1 2 MM294/pSET507 are plated onto L-agar (10 g tryptone, 10 g NaCI, 5 g yeast extract, 
and 1 5 g agar per liter) plates containing 100 u,g/ml ampiciliin to obtain a single colony isolate of the strain. This colony 
is used to inoculate about 500 ml of L-broth (10 g tryptone, 10 g NaCI, 5 g yeast extract per liter) containing 100 jig/ 
ml ampiciliin, and the resulting culture is incubated at 37° C with aeration until the cells reach stationary phase. Cosmid 

40 DNA is obtained from the cells in accordance with procedures known in the art (see e.g., Rao et al, 1 987 in Methods 
in Enzymology, 153:166). 

DNA of the current invention can be sequenced using any known techniques in the art such as the dideoxynucle- 
otide chain-termination method (Sanger, et al., Proc. Natl. Acad. Sci. 74:5463 (1977)) with either radioisotopic or flu- 
orescent labels. Double-stranded, supercoiled DNA was used directly for templates in sequence reactions with se- 

45 quence-specific oligonucleotide primers. Alternatively, fragments were used to prepare libraries of either random, over- 
lapping sequences in the bacteriophage M13 or nested, overlapping deletions in a plasmid vector. Individual recom- 
binant DNA subclones were then sequenced with vector-specific oligonucleotide primers. Radioactive reaction products 
were elect rophoresed on denaturing polyacrylamide gels and analyzed by autoradiography. Fluorescent- labeled re- 
action products were electrophoresed and analyzed on Applied Biosystems (ABI Division, Perkin Elmer, Foster City, 

50 CA 94404) model 370A and 373A or DuPont (Wilmington, DE) Genesis DNA sequencers. Sequence data was assem- 
bled and edited using Genetic Center Group (GCG, Madison, Wl) programs GelAssemble and Seqed or the ABI model 
670 Inherit Sequence Analysis system and the AutoAssembler and SeqEd programs. 

Polypeptides corresponding to a domain, a synthase unit, or a multi-functional polypeptide can be produced by 
expression of the cDNA sequence in a bacteria, for example, using known expression vectors. Alternatively, the 

55 polypeptides mentioned above can be extracted from tylactone-producing bacteria such as Streptomyces fradiae. In 
addition, the techniques of synthetic chemistry can be employed to synthesize the polypeptides mentioned above. The 
procedures and techniques for isolation and purification of homogenous protein or polypeptides are well known in the 
art. 



4 



EP 0 791 655 A2 



Since the genetic organization of the tylactone synthase gene cluster appears to correspond to the order of the 
reactions required to complete synthesis of tylactone, knowledge of the tylactone synthase DNA sequence, its genetic 
organization, and the activities associated with particular open reading frames, modules, and submodules of the gene 
enables production of novel polyketides having a predicted structure that are not otherwise available. Modifications 

s may be made to the DNA sequence that either alter the initial carboxylic acid building block used or the building block 
added at any of the condensation steps. The tylactone synthase gene may also be modified to after the actual number 
of condensation steps done, thereby changing the size of the carbon backbone. Modifications to portions of the DNA 
sequence that encode the post -condensation processing activities will alter the functional groups appearing at the 
various condensation sites on the carbon chain backbone. 

10 These modifications can be accomplished by substituting submodules derived from the tylactone synthase gene 

and having known activities for corresponding submodules from another polyketide synthase gene having different 
activities. Submodules from tylactone synthase may also be combined with submodules from other polyketide synthase 
genes to effect additional catalytic steps. Accordingly, a DNA molecule wherein at least one submodule from the ty- 
lactone synthase gene has been combined with, or substituted for, submodules from the DNA sequence of other 

is polyketide synthase genes is also provided by the present invention. Further, submodules that are a part of the present 
invention may be selectively inactivated thereby giving rise to predictable novel polyketide structures. 

For example, a submodule encoding a KS 1 activity, an AT activity, and an ACP activity, all derived from the first 
open reading frame of the tylactone synthase gene (ORF1), were exchanged for the corresponding submodule in the 
- platenolide synthase gene (see Example 2). Platenolide is the polyketide backbone of the antibiotic spiramycin. 

20 The submodule from the tylactone synthase gene encodes a domain that catalyzes the incorporation of a propi- 

onate as the initial building block whereas the corresponding submodule of the platenolide synthase gene encodes a 
domain that catalyzes the incorporation of an acetate building block. When the resulting DNA molecule was placed 
into a bacterial strain and grown under conditions promoting polyketide synthesis, a hybrid molecule having the struc- 
ture that would be predicted by incorporation of an additional methyl side-chain at the start of the growing polyketide 

25 chain was synthesized, in particular, 16-methyl platenolide. 

One skilled in the art is fully familiar with the degeneracy of the genetic code. Consequently, the skilled artisan can 
modify the specific DNA sequences provided by this disclosure to provide proteins having the same or improved char- 
acteristics compared to those polypeptides specifically provided herein. Also, one skilled in the art can modify the DNA 
sequences to express an identical protein to those provided, albeit expressed at higher levels. Furthermore, one skilled 

30 in the art is familiar with means to prepare synthetically, either partially, or in whole, DNA sequences which would be 
useful in preparing recombinant DNA vectors or coding sequences which are encompassed by the current invention. 
Additionally, recombinant means for modifying the DNA sequences provided may include for example site-directed 
deletion or site-directed mutagenesis. These techniques are well known to those skilled in the art and require no further 
elaboration here. Consequently, as used herein, DNA which is isolated from natural sources, prepared synthetically 

35 or semi-synthetically, or which are modified by recombinant DNA methods, are within the scope of the present invention. 

Likewise, those skilled in the art will recognize that the polypeptides of the invention may be expressed recom- 
binantly. Alternatively, these polypeptides may be synthesized as well, either in whole or in part, by conventional known 
non-recombinant techniques; for example, solid-phase synthesis. Thus, the present invention should not be construed 
as necessarily limited to any specific vector constructions or means for production of the specific polyketide synthase 

40 molecules exemplified. These alternate means for preparing the present polypeptides are meant to be encompassed 
by the present invention. 

Many cyclized polyketides undergo glycosylation at one or more sites. Tylosin is a 16-membered cyclic lactone, 
tylactone, with three attached sugar residues. The process of converting tylactone to tylosin is will known in the art. 
The present invention also provides the information needed to synthesize novel tylosin-related polyketides based on 
45 tylactone. The principles have already been described above. In addition, any product resulting from post-transcriptional 
or post-translational modification in vivo or in vitro based on the DNA sequence information disclosed herein are meant 
to be encompassed by the present invention. 

The following examples are provided for exemplification purposes only and are not intended to limit the scope of 
the invention which has been described in broad terms above. 

50 

EXAMPLE 1 

The DNA sequence of the Streptomyces fradiae tylactone synthase gene, tylG, was obtained by sequencing the 
inserts of recombinant DNA subclones containing contiguous or overlapping DNA segments, which when considered 
55 cumulatively span the entire region of tylG. All sequences representing tylG are fully contained in the overlapping 
cosmid subclones pSET506 and pSET507. 

In accord with the current invention, the sequence may now be obtained by subcloning and sequencing the DNA 
fragment designated by EcoRI restriction sites at 3.0 and 4.0 kb on the map presented in Figure 3, the fragment bounded 
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by the EcoRI site at 4.0 kb and the BamH\ site at 7.5 kb, the fragments bounded by BarrM\ sites at 7.5, 12.0, 23.4, 
27.3, 34.6, 39.5, and 50.5 kb. 

In order to obtain the tylG gene on a single DNA fragment, the 31 .6 kb DNA fragment bounded by the EcoRI site 
at 3.0 kb and the BamH) site at 34.6 kb is isolated from a partial digestion of pSET506 with the restriction enzymes 
EcoRI and BamH\. The 15.9 kb DNA fragment bounded by the BamHI sites at 34.6 and 50.5 kb is isolated from a 
partial digestion of pSET507 with the restriction enzyme BamU\. The resulting fragments are ligated and cloned in an 
appropriate recombinant DNA vector. Clones containing the correct orientation of the two ligated fragments are iden- 
tified by restriction enzyme site mapping. 

EXAMPLE 2 

Production of a polyketide hybrid of platenolide and tylactone 

The lactone rings of the polyketides platenolide and tylactone undergo an identical set of post -condensation 
processing steps. However, these two polyketides are synthesized by condensation of more than one type of carboxylic 
acid and the specific building blocks chosen for incorporation in the two pathways differ. The organization of the pla- 
tenolide gene cluster is shown in Figure 4 along with the accompanying synthetic pathway in Figure 5. The specific 
carboxylic acid substrates that are used for each condensation reaction and the post-condensation activities of plate- 
nolide synthesis are indicated. 

The DNA sequence of the Streptomyces ambofaciens platenolide synthase (srmG) genes was obtained by se- 
quencing inserts of recombinant DNA subclones containing contiguous or overlapping DNA segments, which when 
considered together span the entire srmG region. All sequences representing srmG are fully contained in the overlap- 
ping cosmid clones pKC1080 and pKC1306 (Figure 6). The sequence can be obtained by subcloning and sequencing 
the fragments bounded by NnA sites at position 1 , 0.3 kb, 8.2 kb, 1 4. 1 kb, 20.2 kb, 29.5 kb, 31 .4 kb, 41 .1 kb, and 42.0 kb. 

In order to obtain the srmG region on a single fragment, the 25.0 kb fragment bounded by the NnA site at position 
1 and the Sfu\ site at 25.0 kb is isolated from a partial digestion of pKC1080 with restriction enzymes NnA and Sfu\. 
The 17.8 kb DNA fragment bounded by the SfiA sites at 25.0 kb and 42.8 kb is isolated by digestion of pKC1306 with 
the restriction enzyme SfiA. The resulting fragments are ligated and cloned in an appropriate recombinant DNA vector. 
Clones containing the correct orientation of the two ligated fragments are identified by restriction enzyme mapping. 

An exchange of tylactone polyketide synthase DNA with corresponding platenolide DNA was effected as follows 
in order to generate a novel polyketide structure. 

A strain of Streptomyces ambofaciens, the organism that produces platenolide, was constructed with most of ORF1 
deleted. This ORF1 -deficient strain produced no detectable platenolide. To confirm that the lack of ORF1 was the only 
deficiency in platenolide production, a construct containing ORF1, and not any functional activities of ORFs 2-5, was 
introduced into the ORF1 -deficient strain on a vector that contains the site-specific integration function from the strep- 
tomycete phage <|>C31. Integration of ORF1 at the <{>C31 att site restored spiramycin production to parental levels, 
confirming that ORF1 codes for a functional protein and that ORFs 2-5 are expressed in the ORF1 deficient strain. 

ORF1 of tylG is organized like srmG ORF1 . DNA coding for a KS\ an AT, and an ACP from (y/ORF1 was exchanged 
with the corresponding region from srm ORF1 by replacing an EcoR\-Apa\ fragment of sm? ORF1 with an EcoH\-SM 
fragment from fy/ORF1, generating plasm id pKC1524. When pKC1524 was introduced into the ORF1 -deficient strain, 
polyketide synthesis was restored. The products produced by this restored strain were indistinguishable from those 
produced by the parental strain of Streptomyces ambofaciens when analyzed by thin-layer chromatography and HPLC. 

The hybrid ORF1 DNA sequence is SEQ ID NO:7 and the amino acid sequence of the ORF1 -encoded polypeptide 
is SEQ ID NO:8. The rest of the DNA sequence of the hybrid molecule is identical to original tylG sequence and the 
polypeptides encoded by this remaining portion are therefore identical as well. 

In order to simplify physical analysis of the putative novel polyketide, an Xho\ fragment encoding sugar synthesis 
or addition was deleted from the restored. Deletion of this fragment led to a strain that produced only the lactone without 
any sugar residues added. When analyzed by NMR and mass spectroscopy, the isolated product made by this sugar- 
deficient strain was confirmed as 16-methylplatenolide, which is the structure that would be predicted from the incor- 
poration of a propionate in place of the acetate normally utilized in platenolide synthesis. 
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SEQUENCE LISTING 
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10 



(1) GENERAL INFORMATION : 

15 (i) APPLICANT: ELI LILLY AND COMPANY 

(B) STREET: Lilly Corporate Center 

(C) CITY: Indianapolis 

(D) STATE: Indiana 

(EJ COUNTRY: United States of America 
(F) ZIP: 46285 

20 

<ii) TITLE OF INVENTION: POLYKETIDE SYNTHASE GENES 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

25 (A) ADDRESSEE: K. G. TAPPING 

(B) STREET: Erl Wood Manor 

(C) CITY: Windlesham 

(D) STATE: Surrey 

(E) COUNTRY: United Kingdom 

(F) ZIP: GU20 6PH 

30 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: Macintosh 

(C) OPERATING SYSTEM: Macintosh 7.0 

(D) SOFTWARE: Microsoft Word 5.1 

35 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUQICE CHARACTERISTICS: 

(A) LENCra: 43280 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

so (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 816.. 14234 

(ix) FEATURE: 
55 (A) NAME/KEY: CDS 

(B) LOCATION: 14351.. 19945 
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10 



20 



25 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 20010.. 31199 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 31232.. 36067 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 36249.. 41774 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:l: 

75 GAATTCGCGG TCCGCTCGGG TTCCGGTCCG TTTTCTGCTT CGAGCCTCTG TGTCGTCATC 60 

TCGGCTCTCT CATCGGGCTG GTGCGGGAAG GCATCCCGGT GCGGGGAGAC ATCCTGCTGC 120 

GGGAAGGAAT CCTGGTGCGG GAAGGCAACG ACTGCGGGAC GCGGGAGAAA GGGGAATCGG 180 

CGGCAATTTC CCCCGCGCGG CGGGGACGGT GCCGGAGAAC AACGGCGGGG AAACAGCCCG 240 

CGGTCCGTGA CGACAACGGA AACTATGGTC CGCTTCCTCC GTCCACAAGG CGGAACCTGA 300 

CATAGTCCCC GCCCACGCGG AAATCCCGCA CGGCGGCCCG CCGGCCGCCG CACCGGACCT 360 

GACATAGCCT CGCCGGACCG CTCCGGTGCG GCCACCCCGT TGGTGTTOGG TGATGAGGTA 420 

CCGGATCAGA GGAGAAAGCA CCATGCCCCG CCCCTCGGCC AGCGAACCGC GCGGGACCAC 480 

30 CCGTTCGGCG ACCGCGCTGG CGCGCCGCCG TGGACCGGGC CGTAACTCCC CTGCGCCATC 540 

GAATACTTCG CCCCTCGAAT CCCTCACCGG GCGAGTTCCA GGACCGCCCC TCGCTCTCGC 600 

CATACCGGAG AACGAACCCG AACGGCACGG CGGAAAGCCC GTCCGCAATG CCCGGGACAT 660 

35 TCCTGTGACC CGACAACACG GTTTGCCGAC ACGGTTCGCC GCAACGCTTG TTOGCAOGCT 720 

CACCGGCACG GCCCGCTGAC ACAGCTCGGT GACACGGCAG CCTGACOOGA AACCGCCGAA 780 

GCCTCTGGAG TCCTCGCACA TTCCGGAGAG AACAG GTG TCT TCC GCG CTG COG 833 
40 Met Ser Ser Ala Leu Arg 

1 5 

CGC GCG GTG CAA TCC AAC TGT GGC TAC GGA GAC CTC ATG ACC TCG AAC 881 
Arg Ala Val Gin Ser Asn Cys Gly Tyr Gly Asp Leu Met Thr Ser Asn 
45 10 15 20 

ACC GCT GCA CAG AAC ACC GGC GAC CAG GAA GAC CTC GAC GGT CCC GAC 929 
Thr Ala Ala Gin Asn Thr Gly Asp Gin Glu Asp Val Asp Gly Pro Asp 
25 30 35 

50 AGC ACA CAC GGT GGG GAG ATC GCC GTC GTG GGA ATG TCG TGC CGT CTG 977 

Ser Thr His Gly Gly Glu He Ala Val Val Gly Met Ser Cys Arg Leu 
40 45 50 

CCG GGC GCG GCC GGT GTC GAG GAA TTC TGG GAA CTG CTG CGC AGC GGA 1025 
55 Pro Gly Ala Ala Gly Val Glu Glu Phe Trp Glu Leu Leu Arg Ser Gly 

55 60 65 70 
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CGC GGT ATG CCC ACC CGT CAG GAC GAC GGC ACC TGG COG GCC GCC CTG 1073 
Arg Gly Met Pro Thr Arg Gin Asp Asp Gly Thr Trp Arg Ala Ala Leu 
75 80 85 

GAG GAC CAC GCC GGC TTC GAC GCC GGG TTC TTC GGC ATG AAC GCC CGG 1121 
Glu Asp His Ala Gly Phe Asp Ala Gly Phe Phe Gly Met Asn Ala Arg 
90 95 100 

CAG GCC GCC GCC ACC GAC CCG CAG CAC CGA CTG ATG CTG GAA CTC GGA 1169 
Gin Ala Ala Ala Thr Asp Pro Gin His Arg Leu Met Leu Glu Leu Gly 
105 110 115 



TGG GAG GCA CTG GAG GAC GCG GGC ATC GTC CCC GGC GAT CTC ACC GGC 1217 
Trp Glu Ala Leu Glu Asp Ala Gly He Val Pro Gly Asp Leu Thr Gly 
« 120 125 130 

ACC GAC ACC GGA GTC TTC GCC GGC GTG GCG TCC GAC GAC TAT GCC GTT 1265 
Thr Asp Thr Gly Val Phe Ala Gly Val Ala Ser Asp Asp Tyr Ala Val 
135 140 145 150 

20 

CTC ACC CGC CGT TCC GCC GTC TCC GCC GGG GGA TAC ACC GCC ACG GGG 1313 
Leu Thr Arg Arg Ser Ala Val Ser Ala Gly Gly Tyr Thr Ala Thr Gly 
155 160 165 

CTG CAC CGC GCC CTG GCC GCC AAC CGC CTC TCC CAC TTC CTG GGC CTG 1361 
25 Leu His Arg Ala Leu Ala Ala Asn Arg Leu Ser His Phe Leu Gly Leu 

170 175 180 

CGC GGC CCC AGC CTG GTC GTC GAC TCG GCC CAG TCC GCC TCA CTG GTG 1409 
Arg Gly Pro Ser Leu Val Val Asp Ser Ala Gin Ser Ala Ser Leu Val 
30 185 190 195 

GCC GTC CAG CTC GCC TGC GAG AGT CTG CGC CGG GGT GAG ACG TCG CTC 1457 
Ala Val Gin Leu Ala Cys Glu Ser Leu Arg Arg Gly Glu Thr Ser Leu 
200 205 210 

GCC GTC GCG GGC GGT GTC AAC CTC ATC CTC ACC GAG GAG AGC ACC ACC 1505 
Ala Val Ala Gly Gly Val Asn Leu He Leu Thr Glu Glu Ser Thr Thr 
215 220 225 230 

GTC ATC GAG CGT ATG GGA GCG CTC TCA CCC GAC GGC CGC TGC CAC ACC 1553 
40 Val Met Glu Arg Met Gly Ala Leu Ser Pro Asp Gly Arg Cys His Thr 

235 240 245 

TTC GAC GCC CGC GCC AAC GGC TAC GTA CGC GGC GAG GGC GGC GGA GCC 1601 
Phe Asp Ala Arg Ala Asn Gly Tyr Val Arg Gly Glu Gly Gly Gly Ala 
4S 250 255 260 

GTC GTG CTC AAG CCA CTG GAC GCC GCA CTC GCC GAC GGC GAC CGC GTG 1649 
Val Val Leu Lys Pro Leu Asp Ala Ala Leu Ala Asp Gly Asp Arg Val 
265 270 275 

TAC TGC GTC ATC AAG GGA GGT GCC GTC AAC AAC GAC GGC GGC GGC GCG 1697 
Tyr Cys Val He Lys Gly Gly Ala Val Asn Asn Asp Gly Gly Gly Ala 
280 285 290 

AGC CTC ACC ACT CCC GAC CGG GAG GCG CAG GAA GCT GTG CTG CGC CAG 1745 
55 Ser Leu Thr Thr Pro Asp Arg Glu Ala Gin Glu Ala Val Leu Arg Gin 

295 300 305 310 



9 



EP0 791 655 A2 



GCC TAC CGG CGG GCG GGC GTC AGC ACC GGC GCC GTC CGC TAC GTC GAG 1793 
Ala Tyr Arg Arg Ala Gly Val Ser Thr Gly Ala Val Arg Tyr Val Glu 
315 320 325 

CTG CAC GGG ACC GGC ACC CGG GCC GGC GAC CCC GTC GAG GCG GCC GCA 1841 
Leu His Gly Thr Gly Thr Arg Ala Gly Asp Pro Val Glu Ala Ala Ala 
330 335 340 

10 CTG GGC GCC GTG CTC GGG GCG GGG GCG GAC AGC GGC CGC AGC ACG CCG 1889 

Leu Gly Ala Val Leu Gly Ala Gly Ala Asp Ser Gly Arg Ser Thr Pro 
345 350 355 



15 



20 



25 



35 



40 



50 



55 



CTC GCC GTC GGC TCG GTG AAG ACC AAC GTC GGC CAT CTG GAG GGC GCG 1937 
Leu Ala Val Gly Ser Val Lys Thr Asn Val Gly His Leu Glu Gly Ala 
360 365 370 

GCG GGC ATC GTC GGA CTG ATC AAG GCC ACG CTG TCC GTA CGG AAG GGC 1985 
Ala Gly lie Val Gly Leu lie Lys Ala Thr Leu Cys Val Arg Lys Gly 
375 380 385 ~ 390 

GAA CTC GTC CCC AGC CTC AAC TTC AGC ACG CCG AAC CCT GAC ATC CCC 2033 
Glu Leu Val Pro Ser Leu Asn Phe Ser Thr Pro Asn Pro Asp He Pro 
395 400 * 405 

CTC GAC GAC CTG CGG CTG CGC GTC CAG ACC GAA CGG CAG GAG TGG AAC 2081 
Leu Asp Asp Leu Arg Leu Arg Val Gin Thr Glu Arg Gin Glu Trp Asn 
410 415 420 



GAG GAG GAC GAC CGG CCG CGC GTG GCC GGC GTC TCC TCC TTC GGT ATG 2129 
Glu Glu Asp Asp Arg Pro Arg Val Ala Gly Val Ser Ser Phe Gly Met 
30 425 430 435 

GGC GGA ACC AAT GTC CAC CTC GTG ATC GCG GAG GCT CCG GCC GCG GCG 2177 
Gly Gly Thr Asn Val His Leu Val He Ala Glu Ala Pro Ala Ala Ala 
440 445 450 



GGG TCC TCC GGG GCG GGG GGT TCG GGC GCT GGT TCC GGT GCC GGT ATC 2225 
Gly Ser Ser Gly Ala Gly Gly Ser Gly Ala Gly Ser Gly Ala Gly He 
455 460 465 470 

AGC GCT GTT TCT GGT GTG GTC CCG GTC GTC GTT TCG GGG CGT TCG CGG 2273 
Ser Ala Val Ser Gly Val Val Pro Val Val Val Ser Gly Arg Ser Arg 
475 480 485 



GTG GTG GTG CGG GAG GCT GCG GGC CGG TTG GCG GAG GTG GTG GAG GCC 2321 
Val Val Val Arg Glu Ala Ala Gly Arg Leu Ala Glu Val Val Glu Ala 
45 490 495 500 

GGT GGT GTG GGG CTG GCG GAT GTG GCG GTG ACG ATG GCG GAC CGG TCG 2369 
Gly Gly Val Gly Leu Ala Asp Val Ala Val Thr Met Ala Asp Arg Ser 
505 510 515 



CGG TTT GGG TAT CGG GCG GTT GTG CTG GCT CGG GGT GAG GCT GAG CTT 2417 
Arg Phe Gly Tyr Arg Ala Val Val Leu Ala Arg Gly Glu Ala Glu Leu 
520 525 530 

GCC GGG CGT TTG CGG GCG TTG GCG GGG GGT GAT CCG GAC GCG GGT GTG 2465 
Ala Gly Arg Leu Arg Ala Leu Ala Gly Gly Asp Pro Asp Ala Gly Val 
535 540 545 & 550 
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GTC ACC GGT GCG GTT CTC GAC GGT GGT GTG GTT GTC GGT GCT GCC CCC 2513 

Val Thr Gly Ala Val Leu Asp Gly Gly Val Val Val Gly Ala Ala Pro 
555 560 565 

GGC GGT GCC GGT GCT GCC GGT GGT GCC GGT GCT GCC GGT GGT GCC GGT 2561 

Gly Gly Ala Gly Ala Ala Gly Gly Ala Gly Ala Ala Gly Gly Ala Gly 
570 575 580 

GGT GGG GGC GTG GTG TTG GTT TTC CCT GGT CAG GGG ACG CAG TOG GTC 2609 

Gly Gly Gly Val Val Leu Val Phe Pro Gly Gin Gly Thr Gin Tip Val 

585 590 595 



GGG ATG GGT GCG GGG CTG CTG GGG TCT TCG GAG GTG TTT GCG GCG TCG 2657 
Gly Met Gly Ala Gly Leu Leu Gly Ser Ser Glu Val Phe Ala Ala Ser 
* 5 600 " 605 610 

ATG CGG GAG TGT GCG CGG GCG CTG AGT GTT CAT GTG GGG TGG GAT TTG 2705 
Met Arg Glu Cys Ala Arg Ala Leu Ser Val His Val Gly Trp Asp Leu 
615 620 625 630 

20 

CTG GAG GTG GTG TCG GGC GGG GCC GGG TTG GAG CGG GTG GAT GTG GTG 2753 
Leu Glu Val Val Ser Gly Gly Ala Gly Leu Glu Arg Val Asp Val Val 
635 640 645 

CAG CCG GTC ACG TCG GCG GTG ATG GTC TCG CTG GCC CGG TAC TCG CAG 2801 
25 Gin Pro Val Thr Trp Ala Val Met Val Ser Leu Ala Arg Tyr Trp Gin 

650 655 660 

GCG ATG GGT GTC GAC GTC GCT GCG GTC GTC GGT CAT TCC CAG GGG GAG 2849 
Ala Met Gly Val Asp Val Ala Ala Val Val Gly His Ser Gin Gly Glu 
30 665 670 675 

ATC GCC GCT GCC ACG GTC GCG GGG GCG TTG TCG CTC GAG GAT GCG GCG 2897 
He Ala Ala Ala Thr Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala 
680 685 690 

35 GCT GTC GTC GCT CTC CGG GCG GGG TTC ATT GGC CGG TAT CTC GCG GGT 2945 

Ala Val Val Ala Leu Arg Ala Gly Leu He Gly Arg Tyr Leu Ala Gly 
695 700 705 710 

CGT GGT GCG ATC GCG GCT GTT CCG CTC CCT GCC GGC GAG GTC GAG GCC 2993 
40 Arg Gly Ala Met Ala Ala Val Pro Leu Pro Ala Gly Glu Val Glu Ala 

715 720 725 

GGG CTC GCG AAG TCG CCG GGT GTC GAG GTC GCG GCG GTC AAC GGT CCG 3041 
Gly Leu Ala Lys Trp Pro Gly Val Glu Val Ala Ala Val Asn Gly Pro 
45 730 735 740 

GCG TCT ACG GTC GTT TCC GGG GAT CGG CGG GCG GTC GCC GGT TAT GTC 3089 
Ala Ser Thr Val Val Ser Gly Asp Arg Arg Ala Val Ala Gly Tyr Val 
745 750 755 

50 GCC GTC TCT CAG GCG GAG GGT GTG CAG GCC CGG TTG ATA CCG GTC GAC 3137 

Ala Val Cys Gin Ala Glu Gly Val Gin Ala Arg Leu He Pro Val Asp 
760 765 770 

TAC GCC TCT CAC TCC CGC CAT GTC GAG GAC CTC AAG GGC GAG TTC GAG 3185 
55 Tyr Ala Ser His Ser Arg His Val Glu Asp Leu Lys Gly Glu Leu Glu 

775 780 785 790 
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CGG GTG CTC TCC OCT ATC CGC CCC CGC AST CCG COG GTG CCG GTC TGT 
Arg Val Leu Ser Gly He Arg Pro Arg Ser Pro Arg Val Pro Val Cys 
795 800 805 

TCC ACC GTC GCC GGA GAG CAG CCG GGC GAG CCG GTT TTC GAT GCG GGG 
Ser Thr Val Ala Gly Glu Gin Pro Gly Glu Pro Val Phe Asp Ala Gly 
810 815 820 

TAT TGG TTC CGT AAT CTC CGG AAC CGG GTT GAG TTC TCC GCG GTG GTC 
Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val Glu Phe Ser Ala Val Val 
825 830 835 

GGT GGT TTG TTG GAG GAG GGC CAC CGT CGG TTC ATC GAG GTC AGT GCC 
Gly Gly Leu Leu Glu Glu Gly His Arg Arg Phe He Glu Val Ser Ala 
840 845 850 

CAC CCG GTA CTC GTC CAT GCG ATC GAG CAG ACG GCC GAG GCC GCG GAC 
His Pro Val Leu Val His Ala He Clu Gin Thr Ala Glu Ala Ala Asp 
855 . 860 865 870 

CGG AGT GTC CAT GCC ACC GGG ACC CTG CGC CGC CAG GAC GAC AGC CCG 
Arg Ser Val His Ala Thr Gly Thr Leu Arg Arg Gin Asp Asp Ser Pro 
875 880 885 

CAC CGC CTG CTG ACC TCC ACC GCC GAG GCC TGG GCC CAC GGC GCC ACC 
His Arg Leu Leu Thr Ser Thr Ala Glu Ala Trp Ala His Gly Ala Thr 
890 895 900 

CTC ACC TGG GAC CCC GCC CTG CCC CCA GGC CAC CTC ACC ACC CTC CCC 
Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly His Leu Thr Thr Leu Pro 
905 910 915 

ACC TAC CCC TTC AAC CAC CAC CAC TAC TGG CTC GAC ACC ATT GAC GGG 
Thr Tyr Pro Phe Asn His His His Tyr Trp Leu Asp Thr He Asp Gly 
920 925 930 

GGC GGA GGG GAC GAC GCG ACC CAG GAG AAG GAG AGC GGC CCT CTG ACG 
Gly Gly Gly Asp Asp Ala Thr Gin Glu Lys Glu Ser Gly Pro Leu Thr 
935 940 945 950 

CGG GAA CTG CGT GGG CTG CCG TCC TCT CAG AAG CAA CTG GGT TTC CTC 
Arg Glu Leu Arg Gly Leu Pro Ser Ser Gin Lys Gin Leu Gly Phe Leu 
*55 960 965 

CTC GAT CTG GTC TGC CGG CAC ACG GCC GTC GTA CTC GGC CTG GAC ACG 
Leu Asp Leu Val Cys Arg His Thr Ala Val Val Leu Gly Leu Asp Thr 
970 975 980 

GCC GCC GAG GTC GAC CCG GAC CTG TCC TTC AAG AAG CAG GGC ATC CAG 
Ala Ala Glu Val Asp Pro Asp Leu Ser Phe Lys Lys Gin Gly He Gin 
985 990 995 

TCC ATC ACC GGC GTC GAG CTC CGC AAC AGG CTC CTC ACC GAG ACC GGC 
Ser Met Thr Gly Val Glu Leu Arg Asn Arg Leu Leu Thr Glu Thr Gly 
1000 1005 1010 

CTC GCA TTC CCC ACC ACC CTC GTC TAC GAC CGG CCC ACC CCT CGC GCC 
Leu Ala Leu Pro Thr Thr Leu Val Tyr Asp Arg Pro Thr Pro Arg Ala 
1015 1020 1025 1030 



3233 



3281 



3329 



3377 



3425 



3473 



3521 



3569 



3617 



3665 



3713 



3761 



3809 



3857 



3905 
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CTG GCG CAG TTC CTC CAC ACC GAG TTC CTC GAC GGC TCC CCC TCG GGC 3953 
Leu Ala Gin Phe Leu His Thr Glu Leu Leu Asp Gly Ser Pro Ser Gly 
1035 1040 1045 

5 

TCC GTC CTC GCA CCG GCG CAG AAG AGC TTC GAA GCC CAG GAG CCG ATC 4001 
Ser Val Leu Ala Pro Ala Gin Lys Ser Phe Glu Ala Gin Glu Pro lie 
1050 1055 1060 

w GCG GTG GTG GGT ATG GGG TGC CGG TTC CCC GGT GGG GTG GGT TCG CCG 4049 

Ala Val Val Gly Met Gly Cys Arg Phe Pro Gly Gly Val Gly Ser Pro 
1065 1070 1075 

GAG GCG TTG TGG CGG TTG GTG GTG GAG GGG GTG GAC GCG GTT TCC CCG 4097 
Glu Ala Leu Trp Arg Leu Val Val Glu Gly Val Asp Ala Val Ser Pro 
is 1080 1085 1090 

TTT CCC GGT GAT CGT GGC TGG GAT GTC GAG GGG TTG TAC GAC CCG GAG 4145 

Phe Pro Gly Asp Arg Gly Trp Asp Val Glu Gly Leu Tyr Asp Pro Glu 
1095 " 1100 1105 1110 

20 

CCG GGT GTG GCG GGG AAG TCG TAT GTG CGG GAG GGG GGT TTT CTG CAT 4193 

Pro Gly Val Ala Gly Lys Ser Tyr Val Arg Glu Gly Gly Phe Leu His 
1115 1120 1125 

GAT GCG GCG GAG TTC GAT GCG GAG TTC TTC GGG ATT TCG CCG CGT GAG 4241 
25 Asp Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly lie Ser Pro Arg Glu 

1130 1135 1140 

GCG GTG GCG ATG GAT CCG CAG CAG CGG CTG TTG CTG GAG ACC TCC TGG 4289 
Ala Val Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp 
30 1145 1150 1155 

GAG GCG ATC GAG CGG GCG GGT ATC GAC CCG CAC TCG CTG CAC GGC AGC 4337 
Glu Ala lie Glu Arg Ala Gly lie Asp Pro His Ser Leu His Gly Ser 
1160 1165 1170 

35 CGC ACC GGC GTC TAC GCC GGC GTG ATG CCG CAG GAA TAC GGA CCT CGG 4385 

Arg Thr Gly Val Tyr Ala Gly Val Met Pro Gin Glu Tyr Gly Pro Arg 
1175 1180 1185 1190 

CTC GCC GAA GGA GCG GAA GGC AGC GAC GGC TAC CTC CTC ACC GGT ACG 4433 
40 Leu Ala Glu Gly Ala Glu Gly Ser Asp Gly Tyr Leu Leu Thr Gly Thr 

1195 1200 1205 

TCG GGG AGT GTG GTG TCG GGG CGT GTG GCC TAC ACG CTG GGG CTG GAG 4481 

Ser Gly Ser Val Val Ser Gly Arg Val Ala Tyr Thr Leu Gly Leu Glu 
1210 1215 1220 

45 

CGT CCG GCC GTG ACC GTG GAT ACG GCG TGT TCG TCG TCG TTG GTG GCG 4529 

Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala 

1225 1230 1235 

TTG CAT CTG GCG GTG CAG GCG TTG CGG GGT GGC GAG TGT GAC ATG GCG 4577 
Leu His Leu Ala Val Gin Ala Leu Arg Gly Gly Glu Cys Asp Met Ala 
1240 1245 1250 

TTG GCC GGT GGT GTG ACG GTG ATG GCC GGG CCG GGG ATG TTC GTG GAG 4625 
55 Leu Ala Gly Gly Val Thr Val Met Ala Gly Pro Gly Met Phe Val Glu 

1255 1260 1265 1270 
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TTT TCG CGG CAG CGG GGG TTG GCG GCC GAT GGG CGG TGC AAG GCG TTC 4673 
Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe 
1275 1280 1285 

5 

GCG GAT GGG GCG GAT GGG ACC GCT TGG GCC GAG GGT GCG GGG GTG GTG 4721 
Ala Asp Gly Ala Asp Gly Thr Ala Trp Ala Glu Gly Ala Gly Val Val 
1290 1295 1300 

10 CTC GTG GAG CGG TTG TCG GAT GCC CGG CGG TTG GGG CAT CCG GTG TTG 4769 

Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly His Pro Val Leu 
1305 1310 1315 

GCG GTG GTG TGT GGG TCG GCG GTG AAT CAG GAC GGT GCG TCG AAT GGT 4817 
Ala Val Val Cys Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly 
1320 1325 1330 
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TTC ACG GCG CCG AGT GGT CCG TCG CAG GAG CGG GTG ATT CGT CAG GCG 4865 
Leu Thr Ala Pro Ser Gly Pro Ser Gin Glu Arg Val lie Arg Gin Ala 
1335 1340 1345 " 1350 

TTG GGG AAT GCG CGG TTG ACG GTG GCG GAT GTG GAT GTG GTG GAG GCG 4913 
Leu Gly Asn Ala Arg Leu Thr Val Ala Asp Val Asp Val Val Glu Ala 
1355 1360 1365 

CAT GGG ACG GGG ACG CGG CTG GGT GAT CCG ATC GAG GCG CAG GCG TTG 4961 
His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu Ala Gin Ala Leu 
1370 1375 1380 

CTG GGG ACG TAT GGG CGG GAT CGT GAT GGT GGG CGT CCG CTC TGG TTG 5009 
Leu Gly Thr Tyr Gly Arg Asp Arg Asp Gly Gly Arg Pro Val Trp Leu 
1385 1390 1395 

GGG TCG TTG AAG TCG AAT ATT GGT CAT GCT CAG GCG GCT GCG GGG GTG 5057 
Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala Ala Ala Gly Val 
1400 1405 1410 

GCT GGT GTG ATC AAG ATC GTG TTG GCG ATG CGG TAT GGG TGG TTC CCG 5105 
Ala Gly Val He Lys Met Val Leu Ala Met Arg Tyr Gly Trp Leu Pro 
H15 1420 1425 1430 

CGG ACG TTC CAT GTG GAT GAG CCG AGC CGG CAT GTG GAC TGG TCG GCT 5153 
Arg Thr Leu His Val Asp Glu Pro Ser Arg His Val Asp Trp Ser Ala 
1435 1440 1445 

GGT GGT GTG TGG TTC CTC ACC GAG GCG CGG GAG TGG CCG GGG GTC GAC 5201 
Gly Gly Val Trp Leu Leu Thr Glu Ala Arg Glu Trp Pro Gly Val Asp 
1450 1455 1460 

CGG CCG CGT CGG GCG GCG GTC TCC GCC TTT GGT GTC AGT GGT ACC AAC 5249 
Arg Pro Arg Arg Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn 
1465 1470 1475 

GCC CAT CTC ATC CTC GAA GCC CCC GAC ACC GCC GAG GCG GAG AGC GCC 5297 
Ala His Leu He Leu Glu Ala Pro Asp Thr Ala Glu Ala Glu Ser Ala 
1480 1485 1490 

ACG ACC CCG GTC CGC TCT GAG GTC TCG GAG TCT GCT GCG GTC CTC GAT 5345 
Thr Thr Pro Val Arg Ser Glu Val Ser Glu Ser Ala Ala Val Leu Asp 
1495 1500 1505 1510 
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GCC CGC AGT GGT GTG GTC CCG GTG GTG GTT TCG GGG CGT TCG CGG GTG 5393 
Ala Arg Ser Gly Val Val Pro Val Val Val Ser Gly Arg Ser Arg Val 
1515 1520 1525 

GTG GTG CGG GAG GCT GCG GGC CGG TTG GCG GAG GTG GTG GAG GCC GGT 5441 
Val Val Arg Glu Ala Ala Gly Arg Leu Ala Glu Val Val Glu Ala Gly 
1530 1535 1540 

GGT GTG GGG CTG GCG GAT GTG GCG GTG ACG ATG GCG GGC CGG TCG CGG 5489 
Gly Val Gly Leu Ala Asp Val Ala Val Thr Met Ala Gly Arg Ser Arg 
1545 1550 1555 



TTT GGG TAT CGG GCG GTT GTG CTG GCT CGG GGT GAG GCT GAG CTT GCC 5537 
Phe Gly Tyr Arg Ala Val Val Leu Ala Arg Gly Glu Ala Glu Leu Ala 
15 1560 1565 1570 

GGG CGT TTG CGG GCG TTG GCG GGG GGT GAT CCG GAC GCG GGT GTG GTC 5585 
Gly Arg Leu Arg Ala Leu Ala Gly Gly Asp Pro Asp Ala Gly Val Val 
1575 ~ 1580 1585 1590 

20 

ACC GGT GCG GTG GTG GAC CCG GAG ACG GGG TCC GGT GGT GGG GGC GTG 5633 
Thr Gly Ala Val Val Asp Pro Glu Thr Gly Ser Gly Gly Gly Gly Val 
1595 1600 1605 

GTG TTG GTT TTC CCT GGT CAG GGG ACG CAG TGG GTG GGG ATG GGT GCG 5681 
25 Val Leu Val Phe Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala 

1610 1615 1620 

GGG CTG CTG GGG TCT TCG GAG GTG TTT GCG GCG TCG ATG CGG GAG TGT 5729 
Gly Leu Leu Gly Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys 
30 1625 1630 1635 

GCG CGG GCG CTG AGT GTT CAT GTG GAG TGG GAT TTG CTG GAG GTG GTG 5777 
Ala Arg Ala Leu Ser Val His Val Glu Trp Asp Leu Leu Glu Val Val 
1640 1645 1650 

35 TCG GGC GGG GCC GGG TTG GAG CGG GTG GAT GTG GTG CAG CCC GTG ACG 5825 

Ser Gly Gly Ala Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr 
1655 1660 1665 1670 

TGG GCG GTG ATG GTG TCG CTG GCC CGG TAC TGG CAG GCG ATG GGT GTG 5873 
40 Trp Ala Val Met Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val 

1675 1680 1685 

GAC GTG GCT GCG GTG GTG GGT CAT TCC CAG GGG GAG ATC GCT GCT GCC 5921 
Asp Val Ala Ala Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala 
45 1690 1695 1700 

ACG GTG GCG GGG GCG TTG TCG CTG GAG GAT GCG GCG GCT GTG GTC GCT 5969 
Thr Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala 
1705 1710 1715 

50 CTG CGG GCG GGG TTG ATT GGC CGG TAT CTG GCG GGT CGT GGT GCG ATG 6017 

Leu Arg Ala Gly Leu He Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met 
1720 1725 1730 

GCG GCT GTT CCG CTG CCT GCC GGC GAG GTC GAG GCC GGG CTG GCG AAG 6065 
55 Ala Ala Val Pro Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys 

1735 1740 1745 1750 
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TGG CCG GGT GTG GAG GTC GCG GCG GTC AAC GGT CCG GCG TCC ACG GTG 6113 
Trp Pro Gly Val Glu Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val 
1755 1760 1765 

GTT TCC GGG GAT CGG CGG GCG GTG GCC GGT TAT GTG GCC GTC TCT CAG 6161 
Val Ser Gly Asp Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin 
1770 1775 1780 

GCG GAG GGT GTG CAG GCT CGG TTG ATA CCG GTG GAC TAC GCC TCT CAC 6209 
Ala Glu Gly Val Gin Ala Arg Leu lie Pro Val Asp Tyr Ala Ser His 
1785 1790 1795 

TCC CGC CAT GTG GAG GAC CTG AAG GGC GAG TTG GAG CGG GTG CTG TCC 6257 
Ser Arg His Val Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser 
1800 1805 1810 
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GGT ATC CGC CCC CGC AGT CCG CGG GTG CCG GTG TGT TCC ACC GTC GCC 6305 
Gly lie Arg Pro Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala 
1815 1820 1825 1830 

GGA GAG CAG CCG GGC GAG CCG GTT TTC GAT GCG GGG TAT TGG TTC CGT 6353 
Gly Glu Gin Pro Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg 
1835 1840 1845 

AAT CTG CGG AAC CGG GTT GAG TTC TCC GCG GTG GTC GGT GGT TTG TTG 6401 
Asn Leu Arg Asn Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu 
1850 1855 I860 

GAG GAG GGC CAC CGT CGG TTC ATC GAG GTC AGT GCC CAC CCG GTA CTC 6449 
Glu Glu Gly His Arg Arg Phe He Glu Val Ser Ala His Pro Val Leu 
1865 1870 1875 

GTC CAT GCG ATC GAG CAG ACG GCC GAG GCC GCG GAC CGG AGT GTC CAT 6497 
Val His Ala He Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His 
1880 1885 1890 

GCC ACC GGG ACC CTG CGC CGC CAG GAC GAC AGC CCG CAC CGC CTG CTG 6545 
Ala Thr Gly Thr Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu 
1895 1900 1905 1910 

ACC TCC ACC GCC GAG GCC TGG GCC CAC GGC GCC ACC CTC ACC TGG GAC 6593 
Thr Ser Thr Ala Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp 
1915 1920 1925 

CCC GCC CTG CCC CCA GGC CAC CTC ACC ACC CTC CCC ACC TAC CCC TTC 6641 
Pro Ala Leu Pro Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe 
1930 1935 1940 

AAC CAC CAC CAC TAC TGG CTC GAC ACC ACC CCC ACC ACC CCC GCG ACG 6689 
Asn His His His Tyr Trp Leu Asp Thr Thr Pro Thr Thr Pro Ala Thr 
1945 1950 1955 

ACC ACC CAG AGC CCC ACC GAT GCC TGG CGC TAC CGC GTC ACC TGG AAA 6737 
Thr Thr Gin Ser Pro Thr Asp Ala Trp Arg Tyr Arg Val Thr Trp Lys 
I960 1965 1970 

GCC CTG ACC GAA TCC TCC CCC GTC CGC CCT CAC TCC ATC GGT CGC TCC 6785 
Ala Leu Thr Glu Ser Ser Pro Val Arg Pro His Ser He Gly Arg Cys 
1^75 1980 1985 ^ 1990 
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35 



CTC CTC GTT GCA CCC CCG ACC ACC GAC GGC GAG CTC CTC GAC GGA CTG 6833 
Leu Leu Val Ala Pro Pro Thr Thr Asp Gly Glu Leu Leu Asp Gly Leu 
1995 2000 2005 

ACA ACG GTC TTG TCC GAG CGC GGT GCC TCC GTC GCC CGC CTT GAG GTG 6881 
Thr Thr Val Leu Ser Glu Arg Gly Ala Ser Val Ala Arg Leu Glu Val 
2010 2015 2020 

CCC ATC GGC GCG CGG CGT GCC GAG GTC GCC GAA CTG CTC AAG CCC TCC 6929 
Pro He Gly Ala Arg Arg Ala Glu Val Ala Glu Leu Leu Lys Pro Ser 
2025 2030 2035 



ATG GAG TCA GCG GGG GAG GAG AAC ACC ACC GTC GTC TCG CTT CTC GGT 6977 
Met Glu Ser Ala Gly Glu Glu Asn Thr Thr Val Val Ser Leu Leu Gly 
15 2040 2045 2050 

CTG GTG CCC TCC ACG GAC GCG GTC AGG ACG TCG ATA GCG CTC CTC CAG 7025 

Leu Val Pro Ser Thr Asp Ala Val Arg Thr Ser He Ala Leu Leu Gin 
2055 2060 2065 2070 

20 

GCG GTC TCC GAC ATC GGC GTC CCG GCC GCC AGG GTC TCG GCG CTG ACG 7073 

Ala Val Ser Asp He Gly Val Pro Ala Ala Arg Val Trp Ala Leu Thr 
2075 2080 2085 

CGG AGG GCC GTG GCC GTG GTT CCC GGG GAG ACG CCG CAG GAC GCG GGG 7121 
25 Arg Arg Ala Val Ala Val Val Pro Gly Glu Thr Pro Gin Asp Ala Gly 

2090 2095 2100 

GCC CAG TTG TGG GGC TTC GGA CGA GTC GCG GCC CTT GAA CTC CCG GAT 7169 
Ala Gin Leu Trp Gly Phe Gly Arg Val Ala Ala Leu Glu Leu Pro Asp 
30 2105 2110 2115 

ATC TCG GGC GGC TTC ATC GAT CTC CCG GAG ACA GCG GAG CTC ACG CGG 7217 
He Trp Gly Gly Leu He Asp Leu Pro Glu Thr Ala Glu Leu Thr Arg 
2120 2125 2130 

ACG CCG GAG ACC TCA CAG CCC CCA CAG ACC CCG GAG AGG CTG CCG CAG 7265 
Thr Pro Glu Thr Ser Gin Pro Pro Gin Thr Pro Glu Arg Leu Pro Gin 
2135 2140 2145 2150 

ACT CCG AAC CGA CGC GCC CTT GAG CTT GCT GCC GCC GTC CTC GCC GGC 7313 
40 Thr Pro Asn Arg Arg Ala Leu Glu Leu Ala Ala Ala Val Leu Ala Gly 

2155 2160 2165 

CGC GAC GGC GAG GAC CAG GTC GCC GTC CGC GCC TCG GGG ATC TAC GGG 7361 
Arg Asp Gly Glu Asp Gin Val Ala Val Arg Ala Ser Gly He Tyr Gly 
45 2170 2175 2180 

CGG CGG GTG TCG CGG GCC GCG GCA GCG GGG GCC GCC TCC TGG CAG CCG 7409 
Arg Arg Val Ser Arg Ala Ala Ala Ala Gly Ala Ala Ser Trp Gin Pro 
2185 2190 2195 

50 TCC GGC ACG GTG CTC ATC ACC GGC GGC ATC GGT GCC ATC GGC AGG CGG 7457 

Ser Gly Thr Val Leu He Thr Gly Gly Met Gly Ala He Gly Arg Arg 
2200 2205 2210 

CTC GCC CGC AGG CTC GCG GCC GAG GGA GCC GAA CGC CTC GTC CTC ACC 7505 
55 Leu Ala Arg Arg Leu Ala Ala Glu Gly Ala Glu Arg Leu Val Leu Thr 

2215 2220 2225 2230 
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AGC CGT CGC GGA CCG GAG GCG CCG GGC GCC GCC GAA CTC GCC GAG GAA 
Ser Arg Arg Gly Pro Glu Ala Pro Gly Ala Ala Glu Leu Ala Glu Glu 
2235 2240 2245 

CTG CGA GGA CAT GGC TGC GAG GTC GTG CAC GCG GCC TGT GAC GTG GCC 
Leu Arg Gly His Gly Cys Glu Val Val His Ala Ala Cys Asp Val Ala 
2250 2255 2260 

GAG CGT GAT GCG CTC GCC GCG CTC GTC ACC GCG TAT CCG CCG AAC GCC 
Glu Arg Asp Ala Leu Ala Ala Leu Val Thr Ala lyr Pro Pro Asn Ala 
2265 2270 2275 

GTC TTC CAC ACC GCC GGG ATT CTG GAC GAC GCG GTG ATC GAC ACG CTG 
Val Phe His Thr Ala Gly He Leu Asp Asp Ala Val He Asp Thr Leu 
2280 2285 2290 

TCA CCG GAG AGC TTC GAG ACC GTC CGC GGG GCG AAG GTG TGC GGC GCG 
Ser Pro Glu Ser Phe Glu Thr Val Arg Gly Ala Lys Val Cys Gly Ala 
2295 2300 2305 2310 

GAG CTG CTG CAC CAA CTG ACT GCG GAC ATA AAA GGG TTG GAC GCC TTC 
Glu Leu Leu His Gin Leu Thr Ala Asp He Lys Gly Leu Asp Ala Phe 
2315 2320 ~ 2325 

GTC CTC TTC TCC TCC GTC ACC GGC ACA TGG GGC AAC GCC GGC CAG GGT 
Val Leu Phe Ser Ser Val Thr Gly Thr Trp Gly Asn Ala Gly Gin Gly 
2330 2335 2340 

GCG TAC GCC GCC GCC AAC GCC GCG CTC GAC GCC CTC GCC GAG CGT CGC 
Ala Tyr Ala Ala Ala Asn Ala Ala Leu Asp Ala Leu Ala Glu Arg Arg 
2345 2350 2355 

CGT GCC GCC GGA CTG CCC GCG ACC TCC GTC GCC TGG GGC CTT TGG GGC 
Arg Ala Ala Gly Leu Pro Ala Thr Ser Val Ala Trp Gly Leu Trp Gly 
2360 2365 2370 

GGG GGA GGC ATG GCG GCG GGT GCG GGC GAG GAG AGT CTG TCG CGG CGA 
Gly Gly Gly Met Ala Ala Gly Ala Gly Glu Glu Ser Leu Ser Arg Arg 
2375 2380 2385 2390 

GGG CTG CGG GCC ATG GAC CCC GAC GCG GCC GTC GAC GCG CTC CTG GGC 
Gly Leu Arg Ala Met Asp Pro Asp Ala Ala Val Asp Ala Leu Leu Gly 
2395 2400 2405 

GCC ATG GGC AGG AAC GAC GTG TGC GTC ACT GTC GTC GAC GTC GAC TGG 
Ala Met Gly Arg Asn Asp Val Cys Val Thr Val Val Asp Val Asp Trp 
2410 2415 2420 

GAG CGT TTC GCG CCC GCG ACG AAC GCC ATC CGT CCC GGG CGG CTG TTC 
Glu Arg Phe Ala Pro Ala Thr Asn Ala He Arg Pro Gly Arg Leu Phe 
2425 2430 2435 

GAC ACC GTG CCG GAG GCG CGG GAG GCC CTG ACG GCA GCC GGC ACC ACG 
Asp Thr Val Pro Glu Ala Arg Glu Ala Leu Thr Ala Ala Gly Thr Thr 
2440 2445 2450 

TCC GCG ACG CCG GAC GGC GCG CCC GAG CTG GCG CGG CGG TTG TCC ATG 
Ser Ala Thr Pro Asp Gly Ala Pro Glu Leu Ala Arg Arg Leu Ser Met 
2455 2460 2465 2470 
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25 



30 



CTG AAC GAG ACC GAA CGC CTG COG AAG CTG GTC GAA CTC GTC CGT ACC 8273 
Leu Asn Glu Thr Glu Arg Leu Arg Lys Leu Val Glu Leu Val Arg Thr 
2475 2480 2485 

GAG GCG GCC TTT GTG CTG CGG CAT CCG AAC ACG GAC GCC ATC GGC GCC 8321 
Glu Ala Ala Phe Val Leu Arg His Pro Asn Thr Asp Ala lie Gly Ala 
2490 2495 2500 

GAA CGC CCG TTC AAG TCG GCC OGT TTC GAC TCC CTG ACC TCC CTG GAA 8369 
Glu Arg Pro Phe Lys Ser Ala Gly Phe Asp Ser Leu Thr Ser Leu Glu 
2505 2510 2515 



CTC CGC AAC CGC CTC AAT GCC GGC ACA GGC CTG AAG CTA CCC GCC ACC 8417 
Leu Arg Asn Arg Leu Asn Ala Gly Thr Gly Leu Lys Leu Pro Ala Thr 
IS 2520 2525 2530 

GTC ATC TTC GAC CAC CCC AGC CCG ACC GCC CTG GCC AGA CTG CTG CTC 8465 

Val lie Phe Asp His Pro Ser Pro Thr Ala Leu Ala Arg Leu Leu Leu 

2535 * 2540 2545 2550 

20 

GAC CGG CTG ACC GGC GCC GGA GCC CCC GCG CCC GCC GCC GAT GAG CCG 8513 

Asp Arg Leu Thr Gly Ala Gly Ala Pro Ala Pro Ala Ala Asp Glu Pro 
2555 2560 2565 

CCA CTG CCC GTC GCC GTG GCC GAC GAC GAC CCG GTG GTC ATC GTC GGC 8561 
Pro Leu Pro Val Ala Val Ala Asp Asp Asp Pro Val Val He Val Gly 
2570 2575 2580 

ATG GCG TGC CGT TTC CCC GGT GGG GCG GGC ACC CCT GAG GCG CTG TGG 8609 
Met Ala Cys Arg Phe Pro Gly Gly Ala Gly Thr Pro Glu Ala Leu Trp 
2585 2590 2595 

AAG CTG GTG ACC GAG GAG CGT GAC GTC ATA GGC GCC GCG CCC ACC GAC 8657 
Lys Leu Val Thr Glu Glu Arg Asp Val He Gly Ala Ala Pro Thr Asp 
2600 2605 2610 

35 CGG GGC TGG GAC CTG GAT TCC GTC TAC GAC CCG GAG CCG GGT GTG GCG 8705 

Arg Gly Trp Asp Leu Asp Ser Val Tyr Asp Pro Glu Pro Gly Val Ala 
2615 2620 2625 2630 

GGG AAG ACA TAT GTG CGG GAG GGG GGT TTT CTC CAC GAC GCG GCG GAG 8753 
40 Gly Lys Thr Tyr Val Arg Glu Gly Gly Phe Leu His Asp Ala Ala Glu 

2635 2640 2645 

TTC GAC GCG GAG TTC TTC GGG ATT TCG CCG CGT GAG GCG GTG GCG ATG 8801 
Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala Val Ala Met 
2650 2655 2660 

45 

GAT CCG CAG CAG CGG CTG TTG CTG GAG ACC TCC TGG GAG GCG ATC GAG 8849 
Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala He Glu 
2665 2670 2675 

*> CGG GCG GGT ATC GAC CCG CAC TCG CTG CAC GGC AGC CGC ACC GGG GTA 8897 

Arg Ala Gly He Asp Pro His Ser Leu His Gly Ser Arg Thr Gly Val 
2680 2685 2690 

TAC GTC GGG CTG ACC CAT CAG GAG TAC GCC TCC CGG CTG CAC GAG GCC 8945 
55 Tyr Val Gly Leu Thr His Gin Glu Tyr Ala Ser Arg Leu His Glu Ala 

2695 2700 2705 2710 
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GTG GAT GAG CCG AGC CGG CAT GTG GAC TGG TCG GCT GGT GGT GTG CGG 
Val Asp Glu Pro Ser Arg His Val Asp Trp Ser Ala Gly Gly Val Arg 
2955 2960 2965 



9713 



TTG CTG ACC GAG GCG CGG GAG TGG CCG GGG GTG GAC CGG CCG CGT CGG 
Leu Leu Thr Glu Ala Arg Glu Trp Pro Gly Val Asp Arg Pro Arg Arg 
2970 2975 2980 



9761 



w 



GCG GCG GTC TCC GCC TTC GGT GTC AGT GGT ACC AAC GCC CAT CTG ATC 
Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala His Leu lie 
2985 2990 2995 



9809 



is 



CTC GAA GCC CCC GAA GCC CTC GAA GCC CTC GAA GCC ACC GAC GCC CCC 
Leu Glu Ala Pro Glu Ala Leu Glu Ala Leu Glu Ala Thr Asp Ala Pro 
3000 3005 3010 



9857 



20 



GAA GCC CCC GAA GCC CCC GAA GCC CCC GAC GTC ACC GAC GTC ACC GAA 9905 
Glu Ala Pro Glu Ala Pro Glu Ala Pro Asp Val Thr Asp Val Thr Glu 
3015 3020 3025 3030 

GCC CTC GAA GCC CCC GAC GCC ACC GAG GCG GAG GGT GCG AAG GCT CCT 9953 
Ala Leu Glu Ala Pro Asp Ala Thr Glu Ala Glu Gly Ala Lys Ala Pro 
3035 3040 3045 



25 



GGC AGT CCC GAA GAG GCA CAG CCT GCT GTG GGT GTG GTG CCG GTG GTG 
Gly Ser Pro Glu Glu Ala Gin Pro Ala Val Gly Val Val Pro Val Val 
3050 3055 3060 



10001 



30 



GTT TCG GGG CGT TCG CGG GTG GTG GTG CGG GAG GCT GCG GGC CGG TTG 
Val Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly Arg Leu 
3065 3070 3075 



10049 



35 



GCG GAG GTG GTG GAG GCC GGT GGT GTG GGG CTG GCG GAT GTG GCG GTG 10097 
Ala Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val Ala Val 
3080 3085 3090 

ACG ATG GCG GGC CGG TCG CGG TTT GGG TAT CGG GCG GTT GTG CTG GCT 10145 
Thr Met Ala Gly Arg Ser Arg Phe Gly Tyr Arg Ala Val Val Leu Ala 
3095 3100 3105 3110 



CGG GGT GAG GCT GAG CTT GCC GGG CGT TTG CGG GCG TTG GCG GGG GGT 
40 Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala Gly Gly 

3115 3120 3125 



10193 



45 



50 



GAT CCG GAC GCG GGT GTG GTC ACC GGT GCG GTG GTG GAC CCG GAG ACG 10241 

Asp Pro Asp Ala Gly Val Val Thr Gly Ala Val Val Asp Pro Glu Thr 
3130 3135 3140 

GGG TCC GGT GGT GGG GGC GTG GTG TTG GTT TTC CCT GGT CAG GGG ACG 10289 

Gly Ser Gly Gly Gly Gly Val Val Leu Val Phe Pro Gly Gin Gly Thr 
3145 3150 3155 

CAG TGG GTG GGG ATG GGT GCG GGG CTG CTG GGG TCT TCG GAG GTG TTT 10337 

Gin Trp Val Gly Met Gly Ala Gly Leu Leu Gly Ser Ser Glu Val Phe 
3160 3165 3170 



GCG GCG TCG ATG CGG GAG TGT GCG CGG GCG CTG AGT GTT CAT GTG GAG 
55 Ala Ala Ser Met Arg Glu Cys Ala Arg Ala Leu Ser Val His Val Glu 

3175 3180 3185 3190 



L0385 
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TGG GAT TTG CTG GAG GTG GTG TCG GGC GOG GCC GGG TTG GAG CGG GTG 10433 
Trp Asp Leu Leu Glu Val Val Ser Gly Gly Ala Gly Leu Glu Arg Val 
3195 3200 3205 

GAT GTG GTG CAG CCC GTG ACG TGG GCG GTG ATG GTG TCG CTG GCC CGG 10481 
Asp Val Val Gin Pro Val Thr Trp Ala Val Met Val Ser Leu Ala Arg 
3210 3215 3220 

TAC TGG CAG GCG ATG GGT GTG GAC GTG GCT GCG GTG GTG GGT CAT TCC 10529 
Tyr Trp Gin Ala Met Gly Val Asp Val Ala Ala Val Val Gly His Ser 
3225 3230 3235 



15 



CAG GGG GAG ATC GCT GCT GCC ACG GTG GCG GGG GCG TTG TCG CTG GAG 
Gin Gly Glu lie Ala Ala Ala Thr Val Ala Gly Ala Leu Ser Leu Glu 
3240 3245 3250 



10577 



20 



GAT GCG GCG GCT GTG GTC GCT CTG CGG GCG GGG TTG ATT GGC CGG TAT 10625 
Asp Ala Ala Ala Val Val Ala Leu Arg Ala Gly Leu lie Gly Arg Tyr 
3255 3260 3265 3270 

CTG GCG GGT CGT GGT GCG ATG GCG GCT GTT CCG CTG CCT GCC GGC GAG 10673 
Leu Ala Gly Arg Gly Ala Met Ala Ala Val Pro Leu Pro Ala Gly Glu 
3275 3280 3285 



25 



30 



GTC GAG GCC GGG CTG GCG AAG TGG CCC GGA GTA CAG GTA GCC GCG GTC 10721 
Val Glu Ala Gly Leu Ala Lys Trp Pro Gly Val Gin Val Ala Ala Val 
3290 3295 3300 

AAC GGT CCG GCG TCC ACG GTG GTT TCC GGG GAT CGG CGG GCG GTG GCC 10769 
Asn Gly Pro Ala Ser Thr Val Val Ser Gly Asp Arg Arg Ala Val Ala 
3305 3310 3315 



35 



40 



45 



GGT TAT GTG GCC GTC TGT CAG GCG GAG GGT GTG CAG GCT CGG* TTG ATA 10817 
Gly Tyr Val Ala Val Cys Gin Ala Glu Gly Val Gin Ala Arg Leu lie 
3320 3325 3330 

CCG GTG GAC TAC GCC TCT CAC TCC CGC CAT GTG GAG GAC CTG AAG GGC 10865 
Pro Val Asp Tyr Ala Ser His Ser Arg His Val Glu Asp Leu Lys Gly 
3335 3340 3345 3350 

GAG TTG GAG CGG GTG CTG TCC GGT ATC CGC CCC CGC AGT CCG CGG GTG 10913 
Glu Leu Glu Arg Val Leu Ser Gly He Arg Pro Arg Ser Pro Arg Val 
3355 3360 3365 

CCG GTG TGT TCC ACC GTC GCC GGA GAG CAG CCG GGC GAG CCG GTT TTC 10961 
Pro Val Cys Ser Thr Val Ala Gly Glu Gin Pro Gly Glu Pro Val Phe 
3370 3375 3380 

GAT GCG GGG TAT TGG TTC CGT AAT CTG CGG AAC CGG GTT GAG TTC TCC 11009 
Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val Glu Phe Ser 
3385 3390 3395 



50 GCG GTG GTC GGT GGT TTG TTG GAG CAG GGC CAC CGT CGG TTC ATC GAG 11057 

Ala Val Val Gly Gly Leu Leu Glu Gin Gly His Arg Arg Phe Tie Glu 
3400 3405 3410 

GTC AGT GCC CAC CCG GTA CTC GTC CAT GCC ATT GAG CAG ACG GCC GAG 11105 
55 Val Ser Ala His Pro Val Leu Val His Ala He Glu Gin Thr Ala Glu 

3415 3420 3425 3430 
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GCC GCG GAC CGG ACT GTC CAT GCC ACC GGA ACC CTG CGC CGC CAG GAC 
Ala Ala Asp Arg Ser Val His Ala Thr Gly Thr Leu Arg Arg Gin Asp 
3435 3440 3445 



11153 



GAC AGC CCG CAC CGC CTG CTG ACC TCC ACC GCC GAG GCC TGG GCC CAC 
Asp Ser Pro His Arg Leu Leu Thr Ser Thr Ala Glu Ala Trp Ala His 
3450 3455 3460 



11201 



10 



GGC GCC ACC CTC ACC TGG GAC CCC GCC CTG CCC CCA GGT CAC CTC ACC 
Gly Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly His Leu Thr 
3465 3470 3475 



11249 



15 



ACC CTC CCC ACC TAC CCC TTC AAC CAC CAC CAC TAC TGG GCC GTG ACA 
Thr Leu Pro Thr Tyr Pro Phe Asn His His His Tyr Trp Ala Val Thr 
3480 3485 3490 



11297 



20 



25 



TCC CCC GCC GGA GTC GGC GAC GCG GCT GCG GGC CGG TTC GGT ATG ACC 11345 
Ser Pro Ala Gly Val Gly Asp Ala Ala Ala Gly Arg Phe Gly Met Thr 
3495 3500 3505 3510 

TGG GAG GAC CAC CCC TTC CTC CGT GGC GGG TTA CCC CTG GCC GAC TCC 11393 
Trp Glu Asp His Pro Phe Leu Arg Gly Gly Leu Pro Leu Ala Asp Ser 
3515 3520 3525 



GGT GAG CGG GTG TTC GCC GGG CGG CTG GCG GGC TCC GAG CAC GAC TGG 
Gly Glu Arg Val Phe Ala Gly Arg Leu Ala Gly Ser Glu His Asp Trp 
3530 3535 3540 



11441 



30 



35 



45 



50 



CTG ACG GAC CAT GCC GTG TCC GGG GTG ACG TTG CTG CCC GGT ACG GCC 
Leu Thr Asp His Ala Val Ser Gly Val Thr Leu Leu Pro Gly Thr Ala 
3545 3550 3555 



GGT GTG CGG GTG CAG ATG AGG GTG TOG GCC GCC GAC GAG TOG GGA CGG 
40 Gly Val Arg Val Gin Met Arg Val Ser Ala Ala Asp Glu Ser Gly Arg 

3595 3600 3605 



ACG CTC GTG CCG GAC CCG GAG CCC ACG CCT CCG GAC GCC GAC TGG GCG 
Thr Leu Val Pro Asp Pro Glu Pro Thr Pro Pro Asp Ala Asp Trp Ala 
3640 3645 3650 



11489 



TTC GTG GAG TTC GCG CTG CAC GCG GGA GCC GCC ACC GGC TGC GGG CGG 11537 
Phe Val Glu Phe Ala Leu His Ala Gly Ala Ala Thr Gly Cys Gly Arg 
3560 3565 3570 

CTG GAA GAG CTG AGC GTT GAG GCG CCG TTG GTC TTG CCC GCC GCC GGT 11585 
Leu Glu Glu Leu Ser Val Glu Ala Pro Leu Val Leu Pro Ala Ala Gly 
3575 3580 3585 3590 



11633 



CGG AGG GTC GCC ATC CAC TCG GCC CCG GAA GCC GCC GTC CAC TCG GCC 11681 
Arg Arg Val Ala lie His Ser Ala Pro Glu Ala Ala Val His Ser Ala 
3610 3615 3620 

GCA GAA GGC GGC GAC TCG GCC GGT GTC TGG ACG CGG CAC GGC GAG GGC 11729 
Ala Glu Gly Gly Asp Ser Ala Gly Val Trp Thr Arg His Gly Glu Gly 
3625 3630 3635 



11777 



CGG GCC TGG CCG CCC GCC GGG GAA CGC GTC GAA CCG GCC GAG CTC TAC 
55 Arg Ala Trp Pro Pro Ala Gly Glu Arg Val Glu Pro Ala Glu Leu Tyr 

3655 3660 3665 3670 



11825 
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GAA CGG TTC GGG GCC CTG GGC TAC GAG TAC GGT GAG GCG TTC GCG GGC 11873 
Glu Arg Phe Gly Ala Leu Gly Tyr Glu Tyr Gly Glu Ala Phe Ala Gly 
3675 3680 3685 

GTG CGC GCC GTA TGG CGG CAG CCG GAC GCG CTG CTC GCC GAG GTG CTC 11921 
Val Arg Ala Val Trp Arg Gin Pro Asp Ala Leu Leu Ala Glu Val Leu 
3690 3695 3700 



10 CTG CCC GAC CGG GCC TCG ACC GGT GCC GGC CGG TTC GGT GTG CAC CCC 

Leu Pro Asp Arg Ala Ser Thr Gly Ala Gly Arg Phe Gly Val His Pro 
3705 3710 3715 



11969 



15 



GCG CTG CTG GAC GCG GCG CTG CAG CCG TGG ATC GCC GGT GGT CTC CTC 
Ala Leu Leu Asp Ala Ala Leu Gin Pro Trp lie Ala Gly Gly Leu Leu 
3720 3725 3730 



12017 



20 



GAA GTG CCG GAG GAC GCA GTG CTG CTG CCC TTC GCC TGG CAG GGA GTG 12065 
Glu Val Pro Glu Asp Ala Val Leu Leu Pro Phe Ala Trp Gin Gly Val 
3735 3740 3745 3750 

TCG CTC TAC GCG ACG GGT GCC GGT GCT CTG CGG GTG CGG CTG ACG AAG 12113 
Ser Leu Tyr Ala Thr Gly Ala Gly Ala Leu Arg Val Arg Leu Thr Lys 
3755 3760 3765 



25 



30 



35 



40 



45 



50 



55 



GCG GGT GAC GGG GCG GTC TCG CTC CAG GCC GCA GAC ACG TCC GGC GCG 12161 
Ala Gly Asp Gly Ala Val Ser Leu Gin Ala Ala Asp Thr Ser Gly Ala 
3770 3775 3780 

GCC GTG CTC TCC TTG GGG GCC CTG GTG ATG CGT CCG CTG GCG CGC CGG 12209 
Ala Val Leu Ser Leu Gly Ala Leu Val Met Arg Pro Leu Ala Arg Arg 
3785 3790 3795 

AAG CTG GAC GTG CTG CTC GGC ACG GAC GCC GGC GAA CGG TCG CTG TAC 12257 
Lys Leu Asp Val Leu Leu Gly Thr Asp Ala Gly Glu Arg Ser Leu Tyr 
3800 3805 3810 

CGC GTC GAG TGG CAG CCG CGG CTC CTG CCC GCC GGC CCG CCG CGC TCC 12305 
Arg Val Glu Trp Gin Pro Arg Leu Leu Pro Ala Gly Pro Pro Arg Ser 
3815 3820 3825 3830 

TGG GCG GTG CTC GGC CCC GAC GCG GAC CGG CTC GCC GGG ACG CCG GGC 12353 
Trp Ala Val Leu Gly Pro Asp Ala Asp Arg Leu Ala Gly Thr Pro Gly 
3835 3840 " 3845 

CTG GGG GAT CAG CCG GAC GGT GGG CCC ACC GCG CTG TAC CCG GAG GTG 12401 
Leu Gly Asp Gin Pro Asp Gly Gly Pro Thr Ala Leu Tyr Pro Glu Val 
3850 3855 3860 

CGG GCG CTG CGG AAG GCG CTG GCG GCG GGC GCG CCG CGG CCG GAA GCG 12449 
Arg Ala Leu Arg Lys Ala Leu Ala Ala Gly Ala Pro Arg Pro Glu Ala 
3865 3870 3875 

GTC GTA CTG CCG GTG CTC TCC GGG GCC GGG GCC ACT CCG GAG TCG GTG 12497 
Val Val Leu Pro Val Leu Ser Gly Ala Gly Ala Thr Pro Glu Ser Val 
3880 3885 3890 

CGG CAG ACA ACG GAG CGC TGT CTG ACC GCG CTC CAG GAC TGG CTG GAC 12545 
Arg Gin Thr Thr Glu Arg Cys Leu Thr Ala Leu Gin Asp Trp Leu Asp 
3895 3900 3905 3910 
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GCC GAG GAG TTG GTG GAC ACA CCG CTC ATA GTG CTC ACC AGG GGA GCC 12593 
Ala Glu Glu Leu Val Asp Thr Pro Leu He Val Leu Thr Arg Gly Ala 
3915 3920 3925 

GTC GCC GCC GTA CCG GGG GAG GAG ATC GGG GAC CTG GCG TGT GCG GGG 12641 
Val Ala Ala Val Pro Gly Glu Glu lie Gly Asp Leu Ala Cys Ala Gly 
3930 3935 3940 

GTG TGG GGC CTG GTG AGG TCC GCG CGG TCC GAG CAC CCG GGC CGC TTC 12689 
Val Trp Gly Leu Val Arg Ser Ala Arg Ser Glu His Pro Gly Arg Phe 
3945 3950 3955 



15 



GCC CTC GTC GAC ACC GAC GGG CAT CCG GAC GAC CGC ACC GCG CTG CCC 
Ala Leu Val Asp Thr Asp Gly His Pro Asp Asp Arg Thr Ala Leu Pro 
3960 3965 3970 



12737 



20 



25 



30 



35 



40 



45 



50 



55 



CTC GCG CTG CGC GCG GTC CTC GAC GGC GCC GGC CAG CTC TCC CTG CGG 12785 
Leu Ala Leu Arg Ala Val Leu Asp Gly Ala Gly Gin Leu Ser Leu Arg 
3975 3980 3985 3990 

GCC GGC ACC GCC CGG ACC CCG GTC CTC CTC CGG GCC GGG ACC CCG GAG 12833 
Ala Gly Thr Ala Arg Thr Pro Val Leu Leu Arg Ala Gly Thr Pro Glu 
3995 4000 4005 

GAG CAG CGG GGT CCG GCA TTC GAC CCG GCG GGC ACG GTC CTG GTG ACG 12881 
Glu Gin Arg Gly Pro Ala Phe Asp Pro Ala Gly Thr Val Leu Val Thr 
4010 4015 4020 

GGC GCG ACC GGC ACG CTC GGG CGG CTG CTG GCC CGG CAT CTG GCC GCC 12929 
Gly Ala Thr Gly Thr Leu Gly Arg Leu Leu Ala Arg His Leu Ala Ala 
4025 4030 4035 

GAG CAC GGT GTG CGC CAT CTG CTG CTG CTG AGC CGC GGC GGC CGG GCT 12977 
Glu His Gly Val Arg His Leu Leu Leu Leu Ser Arg Gly Gly Arg Ala 
4040 4045 4050 

GCC GAA GGC GCG GAC GAA CTC GCC GCG GAA CTG GCC GGG TTG GAA GCC 13025 
Ala Glu Gly Ala Asp Glu Leu Ala Ala Glu Leu Ala Gly Leu Glu Ala 
4055 4060 4065 4070 

GAG CCG TGC TTC GCG GCC TGT GAC GCG GCG GAC CGC GAG GCC CTC GCA 13073 
Glu Pro Cys Phe Ala Ala Cy« Asp Ala Ala Asp Arg Glu Ala Leu Ala 
4075 4080 4085 

CGG GTG CTG GCG GAG GTG CCG GCC GAC CGG CCG CTG ACC GGA GTG ATC 13121 
Arg Val Leu Ala Glu Val Pro Ala Asp Arg Pro Leu Thr Gly Val He 
4090 4095 4100 

CAC GCG GCC GGG GTG CTC GAC GAC GGC ACA CTC GAC GCG CTC ACC CCG 13169 
His Ala Ala Gly Val Leu Asp Asp Gly Thr Leu Asp Ala Leu Thr Pro 
4105 4110 4115 

GAA CGG ATC GGT ACC GTC ATG CGG CCG AAG GCG GAC GCG GCG CTC AAC 13217 
Glu Arg He Gly Thr Val Met Arg Pro Lys Ala Asp Ala Ala Leu Asn 
4120 4125 4130 

CTG CAC GAA CTG ACC CGG ACC AGC CCG CTG TCG GTG TTC GCG GTC TTC 13265 
Leu His Glu Leu Thr Arg Thr Ser Pro Leu Ser Val Phe Ala Val Phe 
4135 4140 4145 4150 
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TCG GGC GCT GCC GGC ATC CTC GGC CGC CCC GGA CAS GCC AAC TAC GCC 

Ser Gly Ala Ala Gly He Leu Gly Arg Pro Gly Gin Ala Asn Tyr Ala 
4155 4160 4165 

GCC GCC AAC ACC TTC CTC GAC GCG CTC GCG CAG CAC CGC CGC GCC CAC 

Ala Ala Asn Thr Phe Leu Asp Ala Leu Ala Gin His Arg Arg Ala His 

4170 4175 4180 
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55 



GGC CTC CCC GCC GTG TCG CTC GCC TCG GGG CTC TCG GGC GGG GCG ACC 13409 
Gly Leu Pro Ala Val Ser Leu Ala Trp Gly Leu Trp Gly Gly Ala Thr 
4185 4190 4195 

GGC ATC ACC GGC CAT CTC TCC GGC ACC GAT CTC CGC CGG ATC CGC AGG 13457 
Gly Met Thr Gly His Leu Ser Gly Thr Asp Leu Arg Arg Met Arg Arg 
4200 4205 4210 

TCC GGT ATC GCG CCG ATC ACC CAC GAC CAG GGG CTC GCC CTC TTC GAC 13505 
Ser Gly He Ala Pro Met Thr His Asp Gin Gly Leu Ala Leu Phe Asp 
4215 4220 4225 4230 

CGA GCG CTC GCC GCC TCG GCC GAG GAC CCG CTC CTC GTA CCG ATC CGG 13553 
Arg Ala Leu Ala Ala Ser Ala Glu Asp Pro Leu Leu Val Pro Met Arg 
4235 4240 4245 

CTC GAC CTC GCC GCC CTC GTC CGG GAG CGG GCC GAG CAC GGG CCG GAC 13601 
Leu Asp Leu Ala Ala Leu Val Arg Glu Arg Ala Glu His Gly Pro Asp 
4250 4255 4260 

GCG GTG CCC GGA CCG CTC CTC GGG CTC CTC CCC GCC CGG GCC GCG GTC 13649 
Ala Val Pro Gly Pro Leu Leu Gly Leu Leu Pro Ala Arg Ala Ala Val 
4265 4270 4275 

CGG CAG GCG GCG GCA CCG GTA CGC GGC GGA GCC CCC GCC CCC GCC GGC 13697 
Arg Gin Ala Ala Ala Pro Val Arg Gly Gly Ala Pro Ala Pro Ala Gly 
4280 4285 4290 

GGC GAG GGG ACG GCC GAG CGG CTC GCC GGG CTC GGG GAG GAG GCC AGG 13745 
Gly Glu Gly Thr Ala Glu Arg Leu Ala Gly Leu Gly Glu Glu Ala Arg 
4295 4300 4305 4310 

CTC CGC GAG CTC GTC AGG CTC GTC CGC GCC GAG GTC TCG GGC GTG CTC 13793 
Leu Arg Glu Leu Val Arg Leu Val Arg Ala Glu Val Ser Gly Val Leu 
4315 4320 4325 

GGC TAC TCG GGC CCG GAC GCG GTC GAG CCC GGG CGC CCG TTC AAG GAT 13841 
Gly Tyr Ser Gly Pro Asp Ala Val Glu Pro Gly Arg Pro Phe Lys Asp 
4330 4335 4340 

CTC CGC TTC GAC TCG CTC ACC GCC GTC GAG CTC CGC AAC CGC CTC GGC 13889 
Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Gly 
4345 4350 4355 

GCC GCC ACC GGG CTC CGG CTC CCG ACC GCG CTC GTC TTC GAC CGC CCG 13937 
Ala Ala Thr Gly Leu Arg Leu Pro Thr Ala Leu Val Phe Asp Arg Pro 
4360 4365 4370 

ACG TCC CAG GCA GTC GCC GAG TAC CTC GCT GCC CAA CTC GCC GGA CCG 13985 
Thr Ser Gin Ala Val Ala Glu Tyr Leu Ala Ala Glu Leu Ala Gly Pro 
4375 4380 4385 4390 
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COG GAC GGC GGC GAC ACC GCG GCC GCC GCG TTC GAG GGC CTG GAG GCG 
Arg Asp Gly Gly Asp Thr Ala Ala Ala Ala Phe Glu Gly Leu Glu Ala 
4395 4400 4405 



14033 



CTG GCC GCG GCG GTG GGC GCG CTG GCC GAG GAC GAT CTC CGG CGC GAC 
Leu Ala Ala Ala Val Gly Ala Leu Ala Glu Asp Asp Leu Arg Arg Asp 
4410 4415 4420 



14081 



10 



GTG CTC CGG CGG CGA CTG ACC GAA CTG GCC GCC GCG CTC ACC CCG CAG 
Val Leu Arg Arg Arg Leu Thr Glu Leu Ala Ala Ala Leu Thr Pro Gin 
4425 4430 4435 



14129 



15 



GGC CGG AAC CCC TCC GCG CCC GCA CCC GCC CCG TCC GAT CTG GAC GAG 
Gly Arg Asn Pro Ser Ala Pro Ala Pro Ala Pro Ser Asp Leu Asp Glu 
4440 4445 4450 



14177 



20 



CGG CTG GAC TCC GCG AAC GAC GAC GAC CTC TTC GCC TTC ATC GAG GAG 14225 
Arg Leu Asp Ser Ala Asn Asp Asp Asp Leu Phe Ala Phe lie Glu Glu 
4455 4460 4465 4470 

CAG CTT TGA GCAGCGAGAA CGACAGCAGC GAGAACGACG GCGACGACAC 14274 
Gin Leu * 



GGCCGGGGCA GCTCCGGGGA CGGCTCCGGG GGCCTCCCCC CGGCAGGACG ACCGGGTCAG 14334 

25 GGAGTATCTG AAGCGG GTG ACC GCC GAA CTG GTC GCC ACC CGC AAG CGG 14383 

Met Thr Ala Glu Leu Val Ala Thr Arg Lys Arg 
1 5 10 



30 



CTC GGC GCG CTG GAG GAG CGG GCC CGC GAA CCG ATC GCC GTC GTC GCG 
Leu Gly Ala Leu Glu Glu Arg Ala Arg Glu Pro lie Ala Val Val Ala 
15 20 25 
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GCC GGA GTC ACC GCC GAC CGC GCC CGG GGC AGC CGG ACG GGC GTG TAC 
Ala Gly Val Thr Ala Asp Arg Ala Arg Gly Ser Arg Thr Gly Val Tyr 



14431 



ATG AGC TGC CGC TAC CCG GGC GGG GTG ACG ACC CCC GAG GAC CTG TGG 14479 
Met Ser Cys Arg Tyr Pro Gly Gly Val Thr Thr Pro Glu Asp Leu Trp 
30 35 40 

CGG CTT CTC GCG GAC GAA CGC GAC GCC GTA TCC GGA CTT CCC CGG GAC 14527 
Arg Leu Leu Ala Asp Glu Arg Asp Ala Val Ser Gly Leu Pro Arg Asp 
45 50 55 

CGC GGC TGG GAC CTG GAC GCC CTC TAC GAC CCC GAC GGC GGC CCC GGC 14575 
Arg Gly Trp Asp Leu Asp Ala Leu Tyr Asp Pro Asp Gly Gly Pro Gly 
60 65 70 75 

ACC AGC TAC GCC CGC GAA GGC GGC TTC CTG AGC CAC TGC GCC GGA TTC 14623 
Thr Ser Tyr Ala Arg Glu Gly Gly Phe Leu Ser His Cys Ala Gly Phe 
80 B5 90 

GAC GCG GAG TTC TTC GGC ATC TCC CCG CGC GAG GCG CTG GCG ATG GAC 14671 
Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met Asp 
95 100 105 

CCG CAG CAG CGG CTG CTG CTG GAG ACC TCC TGG GAG GCC CTG GAA CGC 14719 
Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg 
110 115 * 120 
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125 130 135 

GCG GGC GTC ATG TAC GAC GAC TAC GGC GCC CGG GTG CTG TAC GGC GCC 14815 
Ala Gly Val Met Tyr Asp Asp Tyr Gly Ala Arg Val Leu Tyr Gly Ala 
S 140 145 150 ~ 15 5 

GGC GCC GGC CCG CCC GAC GAC CTG GAG GGT TAT CTC GTC AAC GGC AGC 14863 
Gly Ala Gly Pro Pro Glu Asp Leu Glu Gly Tyr Leu Val Asn Gly Ser 
160 165 170 

GCG GGC AGC ATC GCC TCC GGC CGT GTC TCC TAC ACG TTC GGG CTG CGC 14911 
Ala Gly Ser He Ala Ser Gly Arg Val Ser Tyr Thr Phe Gly Leu Arg 
175 180 185 

GGC CCC GCG GTC ACC GTC AAT ACG GCC TGT TCG TCG TCA CTG GTG TCG 14959 
Gly Pro Ala Val Thr Val Asn Thr Ala Cys Ser Ser Ser Leu Val Ser 
190 195 200 



is 



CTC CAT CTG GCG GTG CGT GCC CTG CGG AAC GGC GAG TGC GAC ATG GCA 15007 
Leu His Leu Ala Val Arg Ala Leu Arg Asn Gly Glu Cys Asp Met Ala 
so 205 210 215 

CTG GCC GGC GGG GCG ACG GTG CTG TCC ACC CCC ACC GTG CTC GTG GAC 15055 
Leu Ala Gly Gly Ala Thr. Val Leu Ser Thr Pro Thr Val Leu Val Asp 
220 225 230 235 

25 TTC TCC CGC CAG CGC GGT CTG GCC CCC GAC GGC CGC TGC AAG GCG TTC 15103 

Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ala Phe 
240 245 250 

GCC GAC TCC GCC GAC GGC ACC TCC TCG GCC GAG GGC GCC GGA ATG CTG 15151 
30 Ala Asp Ser Ala Asp Gly Thr Ser Trp Ala Glu Gly Ala Gly Met Leu 

255 260 265 

CTG CTC CAG CGG CTG TCC GAC GCC CGC CGC GAG GGG CGC CCC GTG CTG 15199 
Leu Leu Gin Arg Leu Ser Asp Ala Arg Arg Glu Gly Arg Pro Val Leu 
35 270 275 280 

GCC GTC ATT CGC GGC TCG GCC GTC AAC CAG GAC GGC GCC AGC AAC GGA 15247 
Ala Val He Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly 
285 290 295 

40 CTG ACC GCC CCC AAC GGG CGG GCG CAG CGG CAG GTC ATC GAG GAC GCG 15295 

Leu Thr Ala Pro Asn Gly Arg Ala Gin Arg Gin Val He Glu Asp Ala 
300 305 310 315 

CTG CCC GAC GCC GGG GTC GGC CCC GAC CAG GTG GAC GCG GTC GAG GCG 15343 
45 Leu **g Ala Gly Val Gly Pro Asp Gin Val Asp Ala Val Glu Ala 

320 325 330 

CAT CGC ACC GGT ACC GAG CTG GGC GAC CCC ATC GAG GCC GGG GCG CTC 15391 
His Gly Thr Gly Thr Glu Leu Gly Asp Pro He Glu Ala Gly Ala Leu 
335 340 345 

CTC GCC ACC TAT GGA ACG GCC CGT ACG GCG GAG CGC CCG CTG TGG CTC 15439 
Leu Ala Thr Tyr Gly Thr Ala Arg Thr Ala Glu Arg Pro Leu Trp Leu 
350 355 360 



50 



55 GGC TCC CTG AAG TCC AAC ATC GGG CAC ACC CAG GCC GCC GCC GGT GTT 

Gly Ser Leu Lys Ser Asn He Gly His Thr Gin Ala Ala Ala Gly Val 



15487 



28 



10 



15 



20 



25 



EP 0 791 655 A2 

365 370 375 

GCG GGC GTC ATC AAG ATG GTG CTG GCG ATG CGG CAC GGC CGG CTG CCC 15535 
Ala Cly Val He Lys Met Val Leu Ala Met Arg His Gly Arg Leu Pro 
380 385 390 395 

CGC ACC CTG CAC GTG GAC CGG CCC ACC ACC CGG GTG GAC TGG GAG AAG 15583 
Arg Thr Leu His Val Asp Arg Pro Thr Thr Arg Val Asp Trp Glu Lys 
400 405 410 

GGC GGG GTG CGG CTC CTC ACG GAG CCG GTG CCA TGG CCG GGG GAA GCG 15631 
Gly Gly Val Arg Leu Leu Thr Glu Pro Val Pro Trp Pro Gly Glu Ala 
415 420 425 

GGG GAG CCG CGT CGC GCG GGC GTG TCC TCC TTC GGC GCG AGC GGC ACG 15679 
Gly Glu Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ala Ser Gly Thr 
430 435 440 

AAC GCG CAT GTG GTG CTG GAG AGC GTC CCG GCC GGT GAA CCG CCC GCC 15727 
Asn Ala His Val Val Leu Glu Ser Val Pro Ala Gly Glu Pro Pro Ala 
445 450 455 

GCC GGG CGG CCG GAG GAC ACA GGC GGC GCC TGG ACG GTC AGC GGC CGC 15775 
Ala Gly Arg Pro Glu Asp Thr Gly Gly Ala Trp Thr Val Ser Gly Arg 
460 465 470 475 

GGC CCG GCG GCC CTG CGC GCC CAG GCC GCC CGG CTG TAC GAC GCG CTC 15823 
Gly Pro Ala Ala Leu Arg Ala Gin Ala Ala Arg Leu Tyr Asp Ala Leu 
480 485 490 

ACC GGC ACC GGC ACC GGC ACC GGA CAG GGC GCC GGA CAG GGC GCC GGA 15871 
Thr Gly Thr Gly Thr Gly Thr Gly Gin Gly Ala Gly Gin Gly Ala Gly 
495 500 505 

CCC GGC ACC GCC GAG GTG GCC GGC GCG CTG GCC CAC GCC CGT ACC CCG 15919 
Pro Gly Thr Ala Glu Val Ala Gly Ala Leu Ala His Ala Arg Thr Ala 
510 515 520 

TTC CGG CAC CGG GCC GTC GTG CTC GGC GGA AAC CGC GCC GAA CTC CTC 15967 
Phe Arg His Arg Ala Val Val Leu Gly Gly Asn Arg Ala Glu Leu Leu 
525 530 535 

40 GCG GGG CTG CGC GAG CTG GCG GAG GAG GAG CAT CCC GGA CCC CGC GTG 16015 

Ala Gly Leu Arg Glu Leu Ala Glu Glu Glu His Pro Gly Pro Arg Val 
540 545 550 555 

GTC ACA GGG ACC GCC CCG GCC ACC GAG CGG CGG ACG GCC TTC CTC TTC 16063 
Val Thr Gly Thr Ala Pro Ala Thr Glu Arg Arg Thr Ala Phe Leu Phe 
560 565 570 

TCC GGG CAG GGC AGC CAG CGG GCC GGC TCC GGC CGG GGC CTG TAC CGC 16111 

Ser Gly Gin Gly Ser Gin Arg Ala Gly Ser Gly Arg Gly Leu Tyr Arg 
575 580 585 

50 

CGC CAC CCG GTC TTC GCC CGC GCC CTC GAC GAG GTG TGC GCC GCG CTC 16159 

Arg His Pro Val Phe Ala Arg Ala Leu Asp Glu Val Cys Ala Ala Leu 
590 595 600 

55 GAA CCG CAT CTT CAC CGC CCC CTG CGT GAC CTG ATG TTC GCC GAG CCC 16207 

Glu Pro His Leu His Arg Pro Leu Arg Asp Leu Met Phe Ala Glu Pro 
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605 610 615 



GGC AGC CCG GAA GCG GAG CCG CTC GAC CGC ACC GAG TTC ACC CAG CCC 16255 
Gly Ser Pro Glu Ala Glu Pro Leu Asp Arg Thr Glu Phe Thr Gin Pro 
620 625 630 635 

GCG CTG TTC GCG CTC CAG ACC GCC CTG TTC CGG CTG GCC GAG CAC CAC 16303 
Ala Leu Phe Ala Leu Gin Thr Ala Leu Phe Arg Leu Ala Glu His His 
640 645 650 

GGG CTG CGC GCC GAG GCG CTG TGC GGG CAC AGC GTC GGC GAG ATC GCG 16351 
Gly Leu Arg Ala Glu Ala Leu Cys Gly His Ser Val Gly Glu He Ala 
655 660 665 

GCC GCC CAT GCC GCC GGT GTG CTG ACC CTG CCC GAC GCG GCC CGT CTG 16399 
Ala Ala His Ala Ala Gly Val Leu Thr Leu Pro Asp Ala Ala Arg Leu 
670 675 680 

GTG GCC GCC CGG GGA CGG CTG ATG CAG GCC CTG CCG GCC GGC GGT GCC 16447 
Val Ala Ala Arg Gly Arg Leu Met Gin Ala Leu Pro Ala Gly Gly Ala 
685 690 695 

ATG GCC GCG CTG CGC GCC ACC GCC GAG GAG ATC GCA CCA CTC CTG GAG 16495 
Met Ala Ala Leu Arg Ala Thr Ala Glu Glu He Ala Pro Leu Leu Glu 
• 70 ° 705 710 715 

CGC CGC GCG GGC GAA CTG GCG CTC GCC GCC GTC AAC GGC CCC TCC TCG 16543 
Arg Arg Ala Gly Glu Leu Ala Leu Ala Ala Val Asn Gly Pro Ser Ser 
720 725 730 

GTC GTG GTG TCG GGC GAC GAG GCC GCC GTC CTG GAG CTA CTG GAG CAG 16591 
30 Val Val Val Ser Gly Asp Glu Ala Ala Val Leu Glu Leu Leu Glu Gin 

735 740 745 

TCG CGG GCC GAG GGC CGC GAG GCC AGG CGG CTC GCC GTC AGC CAT GCC 16639 
Trp Arg Ala Glu Gly Arg Glu Ala Arg Arg Leu Ala Val Ser His Ala 
750 755 760 



TTC CAT TCA CCG CGG ATG GAC GGA ATG TTG ACA CAG TTC GAC CGG GTC 16687 
Phe His Ser Pro Arg Met Asp Gly Met Leu Thr Gin Phe Asp Arg Val 
765 770 775 

GCT CGC ACC CTC ACG TTC GCT CCG CCG ACC ATT CCC CTC GTC TCC ACC 16735 
Ala Arg Thr Leu Thr Phe Ala Pro Pro Thr He Pro Leu Val Ser Thr 
780 785 790 795 

CTC ACC GGT ACG CCC GTC ACC GAG GAA ACC CTG TGC ACC GCG GAC CAC 16783 
45 Leu Thr Gly Thr Pro Val Thr Glu Glu Thr Leu Cys Thr Ala Asp His 

800 80S 810 

TCG GTC CGC CAG GCG CGC GAG CCG GTC CGC TTC CTG GAC GCC ATC CGG 16831 
Trp Val Arg Gin Ala Arg Glu Pro Val Arg Phe Leu Asp Ala Met Arc 
815 82 o 825 

ACC CTG CGC GCC GAC GGG ATC GAC ACC TTC GTG GAA CTC GGC CCC GAC 16879 
Thr Leu Arg Ala Asp Gly He Asp Thr Phe Val Glu Leu Gly Pro Asp 
830 835 840 

55 GGC GTC CTC TCC GCC ATG GCC CGC GAC TCC GCG GAC GAC CGG CCC GAT 

Gly Val Leu Ser Ala Met Ala Arg Asp Cys Ala Asp Asp Arg Pro Asp 
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845 850 855 

GGC GAC ACA ACC GGG GCC GGG GAC GGG GAG ACC CCC GAT CCG CTG CTC 16975 
Gly Asp Thr Thr Gly Ala Gly Asp Gly Glu Thr Pro Asp Pro Leu Leu 
860 865 870 875 

ACC CTC CCG CTG CTG CGC CGC TCC GTG CCC GAG ACC GGC GAC GCC GAA 17023 
Thr Leu Pro Leu Leu Arg Arg Ser Val Pro Glu Thr Gly Asp Ala Glu 
880 885 890 

CAC CCC GGC GGC TTC GAA CGG GCC CTG GCC ACC GCC TAC GCA CAC GGC 17071 
His Pro Gly Gly Phe Glu Arg Ala Leu Ala Thr Ala Tyr Ala His Gly 
895 900 905 

GTC CCC CTG CGG CTC GCG CCC GCC CCC GAC GCC GCG TCC CTC GCC GTG 17119 
15 Val Pro Leu Arg Leu Ala Pro Ala Pro Asp Ala Ala Ser Leu Ala Val 

910 915 920 

GCC GCC GAA CTG CCC ACC TAC GCC TTC CAG CGC ACC CAC TAC TGG CTC 17167 
Ala Ala Glu Leu Pro Thr Tyr Ala Phe Gin Arg Thr His Tyr Trp Leu 
925 930 935 

20 

GAC GCG CCC GCC GCC CCC GCC GCC CTC CCC GCC GGG CTC GAC GAC GCC 17215 
Asp Ala Pro Ala Ala Pro Ala Ala Leu Pro Ala Gly Leu Asp Asp Ala 
940 945 950 955 

25 GGT CAC CCG CTG CTC ^CC GCG GCC CTC GAC CTG CCC GGC GGA CGC GGA 17263 

Gly His Pro Leu Leu Ser Ala Ala Leu Asp Leu Pro Gly Gly Arg Gly 
960 965 970 



30 



ACG GTG TGG ACC GGA GCG CTC TCC GCG GCC ACC CTG CCC TGG GCC GCG 17311 
Thr Val Trp Thr Gly Ala Leu Ser Ala Ala Thr Leu Pro Trp Ala Ala 
975 980 985 



GAC CAC AGC GTG CAC GGC CGC ACC GTC CTG CCG GGC ACC GCC CTG CTC 17359 

Asp His Ser Val His Gly Arg Thr Val Leu Pro Gly Thr Ala Leu Leu 
990 995 1000 

35 

GAC CTG GCG CTC CAC GCC GCC CCG CGC GTC GGC GAG TTG ACC TTC GAG 17407 

Asp Leu Ala Leu His Ala Ala Pro Arg Val Gly Glu Leu Thr Phe Glu 

1005 1010 1015 

40 GCG CCG CTG GTG CTG CCG GAG GAC GGA GAG GTC CGG CTG CGC GTC GTA 17455 

Ala Pro Leu Val Leu Pro Glu Asp Gly Glu Val Arg Leu Arg Val Val 
1020 1025 1030 1035 

CTC GCT GAA CCG GAC GCG AGC GGA GTA CGC GAA CTG TCT GTC CAC TCC 17503 
Leu Ala Glu Pro Asp Ala Ser Gly Val Arg Glu Leu Ser Val His Ser 
45 1040 1045 1050 

GCC GGC GAG GAC GGC GGC TGG ACC CGG CAC GCG ACA GCG GTC CTG GAC 17551 
Ala Gly Glu Asp Gly Gly Trp Thr Arg His Ala Thr Ala Val Leu Asp 
1055 1060 1065 

50 

ACC GGC ACC ACC ACC GCC GGG GAG CCC GCC GGC GCA CCG CCC GCC GCA 17599 
Thr Gly Thr Thr Thr Ala Gly Glu Pro Ala Gly Ala Pro Pro Ala Ala 
1070 1075 1080 

55 TGG CCG CCC GGG GAC GCC GAA CCC CTC GAC CTT GCC GCC GAG TAC GAG 17647 

Trp Pro Pro Gly Asp Ala Glu Pro Leu Asp Leu Ala Ala Glu Tyr Glu 
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1085 



1090 



1095 



10 



15 



CGC TTC GCC GAC GCC GGC ATC GGA TAC GGC CCC GCC TTC CGC GGA CTG 17695 
Arg Phe Ala Asp Ala Gly lie Gly Tyr Gly Pro Ala Phe Arg Gly Leu 
1100 1105 1110 ~ 1H5 

CGC TCC GCC TGG CGC GAC GGC GAC GCG ATA CTG GCC GAC GTA CGG CTG 17743 
Arg Ser Ala Trp Arg Asp Gly Asp Ala He Leu Ala Asp Val Arg Leu 
1120 1125 H30 

CCC GGC GAA CTG GCC GGC GAA GCC GAC CGG TAC GGC ATC CAC CCG GCC 17791 
Pro Gly Glu Leu Ala Gly Glu Ala Asp Arg Tyr Gly He His Pro Ala 
1135 1140 H45 

CTG CTC GAC GCC GCC CTG CAC ACC GCG GCC GCC GCC CTG GGA GGG GCG 17839 
Leu Leu Asp Ala Ala Leu His Thr Ala Ala Ala Ala Leu Gly Gly Ala 
1150 1155 H60 



20 



CAC GGG ATG CTG CCG TTC ACG TGG AAC GGC GTC ACC CTC CAC GCC CGC 
His Gly Met Leu Pro Phe Thr Trp Asn Gly Val Thr Leu His Ala Arg 
1165 H70 1175 
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GGA GCG CAC GCC ATC CGG GTG CGG CTG ACC CCG GCC GGC CCC GAC GCG 17935 
Gly Ala His Ala He Arg Val Arg Leu Thr Pro Ala Gly Pro Asp Ala 
H80 1185 1190 H95 

GTC GCG GTC ACC GCC GTG GAC CCG GCG GGG CGC CCC GTG TTC ACG GCC 17983 
Val Ala Val Thr Ala Val Asp Pro Ala Gly Arg Pro Val Phe Thr Ala 
1200 1205 1210 

GCC TCG CTC ACC CTG CGA CCG GTC ACG ACC GGG CAG CTG ACC GCG GCC 18031 
Ala Ser Leu Thr Leu Arg Pro Val Thr Thr Gly Gin Leu Thr Ala Ala 
1215 1220 1225 

GAG GCC GCG CGG GCC CCG CTG TAC CGG GTG CGC TGG ACC GGC CTC CCG 18079 
Glu Ala Ala Arg Ala Pro Leu Tyr Arg Val Arg Trp Thr Gly Leu Pro 
1230 1235 1240 

GAC ACC GGA ACC GCC CGG GAC CAC ACC TGG GCG GTG GCC GGC GGC CCG 18127 
Asp Thr Gly Thr Ala Arg Asp His Thr Trp Ala Val Ala Gly Gly Pro 
1245 1250 1255 

GGC GAC CTG TTA CCC GGG GAG ACC CCG CAC CAC CCC GAC CTC GCC TCC 18175 
Gly Asp Leu Leu Pro Gly Glu Thr Pro His His Pro Asp Leu Ala Ser 
1260 1265 1270 1275 

GCG CTC GCC GAC ACC GGC ACC GCC CCC TTC CGC GTA CTC GCG GAT CTC 18223 
Ala Leu Ala Asp Thr Gly Thr Ala Pro Phe Arg Val Leu Ala Asp Leu 
1280 1285 1290 

CGC GGA TAC GGC ACC GCC ACC CCC CGG GAA CTC GCC TCC CAG GCG CTC 18271 
Arg Gly Tyr Gly Thr Ala Thr Pro Arg Glu Leu Ala Ser Gin Ala Leu 
1295 1300 1305 

GCC CTC GTC CAG CAG TGG GCC GAC GCG GCC GAG GCC GCC GAA GGC AGG 18319 
Ala Leu Val Gin Gin Trp Ala Asp Ala Ala Glu Ala Ala Glu Gly Arg 
1310 1315 1320 

CTC GTC CTG GTG ACA CGC CGG GCC GTC GAC ATC GGT GAC GGC GTC ACG 18367 
Leu Val Leu Val Thr Arg Arg Ala Val Asp He Gly Asp Gly Val Thr 
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1325 



1330 



1335 



GAC CCG GCG GCG GCG ACC GTG TGG GGA CTG GTC CGG GCG GCA CAG TCC 
Asp Pro Ala Ala Ala Thr Val Trp Gly Leu Val Arg Ala Ala Gin Ser 
1340 1345 1350 1355 



18415 



10 



GAG CAC CCC GGG TGC TTC GCG CTC CTC GAC ACC GAC GAC TCC CCC CGC 18463 
Glu His Pro Gly Cys Phe Ala Leu Leu Asp Thr Asp Asp Ser Pro Arg 
1360 1365 1370 

TCC CGG CAA CTC CTG CCA CGC GTC GCG GGC ACC GCC GAG CAG CTC GCA 18511 
Ser Arg Gin Leu Leu Pro Arg Val Ala Gly Thr Ala Glu Gin Leu Ala 
1375 1380 1385 



CTC CGC GAC GGC ACC CTG CTC GCC CCC TCC CTC ACC CGT GCC ACG CTG 
15 Leu Arg Asp Gly Thr Leu Leu Ala Pro Ser Leu Thr Arg Ala Thr Leu 

1390 1395 1400 
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CCC GCC GGC GCC CGG CTG CCC GCA CTC GAC GGC ACC GTC CTG ATC ACT 18607 
Pro Ala Gly Ala Arg Leu Pro Ala Leu Asp Gly Thr Val Leu lie Thr 
1405 1410 1415 

GGG GGC ACC GGC AGC CTC GGC GCG GAG GCG GCC CGC CAT CTG GTC ACC 18655 
Gly Gly Thr Gly Ser Leu Gly Ala Glu Ala Ala Arg His Leu Val Thr 
1420 1425 1430 1435 



25 CGG CAC GGT GCC CGG CGC CTG CTC CTG ACC AGC CGA AGC GGC CCG CAG 

Arg His Gly Ala Arg Arg Leu Leu Leu Thr Ser Arg Ser Gly Pro Gin 
1440 1445 1450 

GCC CCC GGC GCG GCC GAA CTC GTC GCC GAA CTG GCC GCC TTG GGC GCC 
30 Ala Pro Gly Ala Ala Glu Leu Val Ala Glu Leu Ala Ala Leu Gly Ala 

1455 1460 1465 



18703 



18751 



CAC GCG GAC GTG GCC GCC TGC GAC GTC GCC GAC CGC GCC GCC CTG CGG 18799 
His Ala Asp Val Ala Ala Cys Asp Val Ala Asp Arg Ala Ala Leu Arg 
1470 1475 1480 

GCC CTG CTC GAC CGC GTA CCC GCC GGC CAC CCG CTG ACC GCG GTC CTG 18847 
Ala Leu Leu Asp Arg Val Pro Ala Gly His Pro Leu Thr Ala Val Leu 
1485 1490 1495 

40 CAC ACG GCG GGC GTC CTG GAC GAC GGC GTC CTC ACC GCC CAG ACA CCG 18895 

His Thr Ala Gly Val Leu Asp Asp Gly Val Leu Thr Ala Gin Thr Pro 
1500 1505 1510 1515 

CAG CGG CTC GCG CCC GTC CTC CCC CCG AAG GCC GAC GCC GTA CGC AAT 18943 
Gin Arg Leu Ala Ala Val Leu Arg Pro Lys Ala Asp Ala Val Arg Asn 
1520 1525 1530 

CTG CAC GAA CTC ACC CAG GGG CAC GCC CTG TCG GCG TTC ATC CTC TAC 18991 
Leu His Glu Leu Thr Gin Gly His Ala Leu Ser Ala Phe He Leu Tyr 
1535 1540 1545 

TCG TCG GCC GCC GGA GTG CTC GGC AGC GCG GGC CAG AGC GGC TAC GCC 19039 
Ser Ser Ala Ala Gly Val Leu Gly Ser Ala Gly Gin Ser Gly Tyr Ala 
1550 1555 1560 

55 GCC GCC AAC GCC TAC CTG GAC TCC TTC GCC GTC TGG CGG CGG AGC CGG 19087 

Ala Ala Asn Ala Tyr Leu Asp Ser Phe Ala Val Trp Arg Arg Ser Arg 
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1565 1570 1575 

GGA CTG CCC GCC GTA TCG CTC GGC TCG GGC CCG TCG GAC GGC GGC GGC 
Gly Leu Pro Ala Val Ser Leu Gly Trp Gly Pro Trp Asp Gly Gly Gly 
1580 1585 1590 1595 

ATG GCG AGC GGA CTC GGC GGA ACC GAC ACG GCC CGG CTG CGG CGC AGC 
Met Ala Ser Gly Leu Gly Gly Thr Asp Thr Ala Arg Leu Arg Arg Ser 
1600 1605 1610 

GGC ATC GCA CCC CTC AGC CGC GCC GAG GGC CTG GCC GCG CTC GAC GCG 
Gly He Ala Pro Leu Ser Arg Ala Glu Gly Leu Ala Ala Leu Asp Ala 
1615 1620 1625 

GCG CTC GCG GCC GGC GGC GAC GAC ACC GCG CCG GCC CAC CTG CTG CCG 
Ala Leu Ala Ala Gly Gly Asp Asp Thr Ala Pro Ala His Leu Leu Pro 
1630 1635 1640 

ATC CGC GTC GAC GCG GTG ACC CTG CGC GGC GCC GAC ACC GTC CCC GCC 
He Arg Val Asp Ala Val Thr Leu Arg Gly Ala Asp Thr Val Pro Ala 
1645 1650 1655 

GTG CTG CGC GAC CTG GCG GGA ACC GCG CCA AGC GCC GCC GAA CGG CCC 
Val Leu Arg Asp Leu Ala Gly Thr Ala Pro Ser Ala Ala Glu Arg Pro 
1660 1665 1670 1675 

CCC GGG ACA CCG GAG GAC ACG AAC GCG CCC CTG GCG GAC GTC ACC CAA 
Pro Gly Thr Pro Glu Asp Thr Asn Ala Pro Leu Ala Asp Val Thr Gin 
1680 1685 1690 

CTG CAC GGC CGG GAA CGG AAG GAG GCA CTG ACC GGC TTC GTA CGC GCC 
Leu His Gly Arg Glu Arg Lys Glu Ala Leu Thr Gly Phe Val Arg Ala 
1695 1700 1705 

CAG GTG GCC GCG GTG CTC GGC CAC CCC ACG TCC GAC ACG ATC GAC GTC 
Gin Val Ala Ala Val Leu Gly His Pro Thr Ser Asp Thr He Asp Val 
1710 1715 1720 

CGC CGG AGC TTC AAG GAA GCG GGC TTC GAC TCC. CTC ACC GCC GTC GAA 
Arg Arg Ser Phe Lys Glu Ala Gly Phe Asp Ser Leu Thr Ala Val Glu 
1725 1730 1735 

CTG CGC AAC CGG CTG CGC GCC GCC ACC GGG CTC AAG CTC CCC GCC ACG 
Leu Arg Asn Arg Leu Arg Ala Ala Thr Gly Leu Lys Leu Pro Ala Thr 
l" 7 * 0 1745 1750 1755 

CTC GTG TTC GAC CAC CCG ACC CCC CTC GCG CTC GCC GGC TTC CTC CAC 
Leu Val Phe Asp His Pro Thr Pro Leu Ala Leu Ala Gly Phe Leu His 
1760 1765 1770 

CGC GAA CTC CCC GGC GCC GAA GCC TCC CTG ATG AGC GCG ATC GAC ACT 
Arg Glu Leu Pro Gly Ala Glu Ala Ser Leu Met Ser Ala He Asp Thr 
1775 1780 1785 

CTC CGG CAC CGG CTC CGC GAC GCC CTG GCC GAC GAC GCC GCA GAC GAC 
Leu Arg His Arg Leu Arg Asp Ala Leu Ala Asp Asp Ala Ala Asp Asp 
1790 1795 1800 

GCC CTG CGC GAC CAG ATC ACC CGA CGA CTC GAG ACC CTC CTC GCC GGC 
Ala Leu Arg Asp Gin He Thr Arg Arg Leu Glu Thr Leu Leu Ala Gly 
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1805 



1810 



1815 



ATA GCC CGG ACC GAG GAG CCC GCG CCC GCC ACC GCC GCC GCC GAC GAC 
lie Ala Arg Thr Glu Glu Pro Ala Pro Ala Thr Ala Ala Ala Asp Asp 
1820 1825 1830 1835 



19855 



10 



GGG AGC GGA GCC GGT GAT GTC GCG GAA CGA CTG AGC ACC GCG TCG GAC 19903 
Gly Ser Gly Ala Gly Asp Val Ala Glu Arg Leu Ser Thr Ala Ser Asp 
1840 1845 1850 

GAC GAA CTG TTC GAA CTG CTC GAC AGC GGT TTC AC A CCC TGA CCGGCCGGCG 19955 
Asp Glu Leu Phe Glu Leu Leu Asp Ser Gly Phe Thr Pro * 

1855 1860 1865 



15 



GACCGGCACC ACGACATCGA GGCGACCGCA CCGAACACCC CCAGGGGGAA AGCT 

GTG TCC ACC GAG AAC TCC ACC AAC GTC CCG GCG AGC GAG 
Met Ser Thr Glu Asn Ser Thr Asn Val Pro Ala Ser Glu 
15 10 



20009 
20048 



20 



GAC AAG CTC CGC GCC TAT CTG CGT CGC GCG ATG GCC GAC CTC CAC GAG 
Asp Lys Leu Arg Ala Tyr Leu Arg Arg Ala Met Ala Asp Leu His Glu 
15 20 25 



20096 



25 



TCC CGC GAA CGG CTC CGC GCC ACG GAA GCG CGC GCC CAG GAG CCG ATC 
Ser Arg Glu Arg Leu Arg Ala Thr Glu Ala Arg Ala Gin Glu Pro lie 
30 35 40 45 



20144 



30 



GCG GTG GTG GGT ATG GGG TGC CGG TTC CCC GGT GGG GTG GGT TCG CCG 20192 
Ala Val Val Gly Met Gly Cys Arg Phe Pro Gly Gly Val Gly Ser Pro 
50 55 60 

GAG GCG TTG TGG CGG TTG GTG GTG GAG GGG GTG GAC GCG GTT TCC CCG 20240 
Glu Ala Leu Trp Arg Leu Val Val Glu Gly Val Asp Ala Val Ser Pro 
65 70 75 



35 



TTT CCC GGT GAT CGT GGC TGG GAT GTG GAG GGG TTG TAC GAC CCG GAG 
Phe Pro Gly Asp Arg Gly Trp Asp Val Glu Gly Leu Tyr Asp Pro Glu 
80 85 90 



20288 



40 



45 



CCG GGT GTG GCG GGG AAG TCG TAT GTG CGG GAG GGG GGT TTT CTG CAT 2033 6 

Pro Gly Val Ala Gly Lys Ser Tyr Val Arg Glu Gly Gly Phe Leu His 
95 100 105 

GAT GCG GCG GAG TTC GAT GCG GAG TTC TTC GGG ATT TCG CCG CGT GAG 20384 
Asp Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly lie Ser Pro Arg Glu 
110 115 120 ~ 125 

GCG GTG GCG ATG GAT CCG CAG CAG CGG CTG TTG CTG GAG ACC TCC TGG 20432 
Ala Val Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp 
130 135 140 



GAG GCG ATC GAG CGG GCG GGT ATC GAC CCG CAT TCG CTG CAC GGC AGC 
so Glu Ala lie Glu Arg Ala Gly lie Asp Pro His Ser Leu His Gly Ser 

145 150 155 



20480 



55 



CGC ACC GGC GTC TAC GCC GGC GTG ATG TAC CAC GAC TAT GGC ACG GGA 
Arg Thr Gly Val Tyr Ala Gly Val Met Tyr His Asp Tyr Gly Thr Gly 
160 165 170 



20528 



35 



10 
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CAG ACC TCC GCG ACC GAC ACG AGC GGT TAT TCC GGC ACC GGT ACG TCG 20576 
Gin Thr Ser Ala Thr Asp Thr Ser Gly Tyr Ser Gly Thr Gly Thr Ser 
175 180 185 

GGG AGT GTG GTG TCG GGG CGT GTO GCC TAC ACG CTG GGG CTG GAG GGT 20624 
Gly Ser Val Val Ser Gly Arg Val Ala Tyr Thr Leu Gly Leu Glu Gly 
1*° 195 200 205 

CCG GCC GTG ACC GTG GAT ACG GCG TGT TCG TCG TCG TTG GTG GCG TTG 20572 
Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu 
210 215 220 

CAT CTG GCG GTG CAG GCG TTG CGG GGT GGC GAG TGT GAC ATG GCG TTG 20720 
His Leu Ala Val Gin Ala Leu Arg Gly Gly Glu Cys Asp Met Ala Leu 
225 230 235 

GCC GGT GGT GTG ACG GTG ATG GCC GGG CCG GGG ATG TTC GTG GAG TTT 20768 
Ala Gly Gly Val Thr Val Met Ala Gly Pro Gly Met Phe Val Glu Phe 
240 245 250 

20 TCG CGG CAG CGG GGG TTG GCG GCC GAT GGG CGG TGC AAG GCG TTC GCG 20816 

Ser Arg Gin Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe Ala 
255 260 265 

GAT GGG GCG GAT GGG ACC GCT TGG GCC GAG GGT GCG GGG GTG GTG CTG 20864 
Asp Gly Ala Asp Gly Thr Ala Trp Ala Glu Gly Ala Gly Val Val Leu 
25 270 275 280 285 

GTG GAG CGG TTG TCG GAT GCC CGG CGG TTG GGG CAT CCG GTG TTG GCG 20912 
Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly His Pro Val Leu Ala 
290 295 300 

30 

GTG GTG TGT GGG TCG GCG GTG AAT CAG GAC GGT GCG TCG AAT GGT TTG 20960 
Val Val Cys Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu 
305 310 315 

^ ACG GCG CCG AGT GGT CCG TCG CAG GAG CGG GTG ATT CGT CAG GCG TTG 21008 

Thr Ala Pro Ser Gly Pro Ser Gin Glu Arg Val He Arg Gin Ala Leu 
320 325 330 

GCG AAT GCG CGG TTG ACG GTG GCG GAT GTG GAT GTG GTG GAG GCG CAT 21056 
Ala Asn Ala Arg Leu Thr Val Ala Asp Val Asp Val Val Glu Ala His 
40 3 3 5 3 4 0 345 

GGG ACG GGG ACG CGG CTG GGT GAT CCG ATC GAG GCG CAG GCG TTG CTG 21104 
Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu Ala Gin Ala Leu Leu 
350 355 360 365 



45 



50 



GGG ACG TAT GGG CGG GAT CGT GAT GGT GGG CGT CCG GTG TGG TTG GGG 21152 
Gly Thr Tyr Gly Arg Asp Arg Asp Gly Gly Arg Pro Val Trp Leu Gly 
370 375 380 

TCG TTG AAG TCG AAT ATT GGT CAT GCT CAG GCG GCT GCG GGG GTG GCT 21200 
Ser Leu Lys Ser Asn He Gly His Ala Gin Ala Ala Ala Gly Val Ala 
385 390 395 



GGT GTG ATC AAG ATG GTG TTG GCG ATG CGG TAT GGG TCG TTC CCG CGG 21248 
Gly Val He Lys Met Val Leu Ala Met Arg Tyr Gly Trp Leu Pro Arg 
55 400 405 410 
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ACG TTG CAT GTG GAT GAG CCG AGC CGG CAT GTG GAC TGG TCG GCT GGT 
Thr Leu His Val Asp Glu Pro Ser Arg His Val Asp Trp Ser Ala Gly 
415 420 425 

GGT GTG TGG TTG CTG ACC GAG GCG CGG GAG TGG CCG GGG GTG GAC CGG 
Gly Val Trp Leu Leu Thr Glu Ala Arg Glu Trp Pro Gly Val Asp Arg 
430 435 440 445 

CCG CGT CGG GCG GCG GTC TCC GCC TTT GGT GTC AGT GGT ACC AAC GCC 
Pro Arg Arg Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala 
450 455 460 

CAT CTG ATC CTC GAA GCC CCC GAC ACC GCC GAG GCG GAG AGC GCC ACG 
His Leu lie Leu Glu Ala Pro Asp Thr Ala Glu Ala Glu Ser Ala Thr 
465 470 475 

ACC CCG GTC CGC TCT GAG GTG TCG GAG TCT GCT GCG GTC TTC GAT GCC 
Thr Pro Val Arg Ser Glu Val Ser Glu Ser Ala Ala Val Phe Asp Ala 
480 485 490 

CGC AGT GGT GTG GTG CCG GTG GTG GTT TCG GGG CGT TCG CGG GTG GTG 
Arg Ser Gly Val Val Pro Val Val Val Ser Gly Arg Ser Arg Val Val 
495 500 505 

GTG CGG GAG GCT GCG GGC CGG TTG GCG GAG GTG GTG GAG GCC GGT GGT 
Val Arg Glu Ala Ala Gly Arg Leu Ala Glu Val Val Glu Ala Gly Gly 
510 515 520 525 

GTG GGG CTG GCG GAT GTG GCG GTG ACG ATG GCG GGC CGG TCG CGG TTT 
Val Gly Leu Ala Asp Val Ala Val Thr Met Ala Gly Arg Ser Arg Phe 
530 535 540 

GGG TAT CGG GCG GTT GTG CTG GCT CGG GGT GAG GCT GAG CTT GCC GGG 
Gly Tyr Arg Ala Val Val Leu Ala Arg Gly Glu Ala Glu Leu Ala Gly 
545 550 555 

CGT TTG CGG GCG TTG GCG GGG GGT GAT CCG GAC GCG GGT GTG GTC ACG 
Arg Leu Arg Ala Leu Ala Gly Gly Asp Pro Asp Ala Gly Val Val Thr 
560 565 570 

GGT GCG GTG GTG GAC CCG GAG ACG GGG TCC GGT GGT GGG GGG GTG GTG 
Gly Ala Val Val Asp Pro Glu Thr Gly Ser Gly Gly Gly Gly Val Val 
575 580 ' 585 

TTG GTT TTC CCT GGT CAG GGG ACG CAG TGG GTG GGG ATG GGT GCG GGG 
Leu Val Phe Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly 
590 595 600 605 

CTG CTG GGG TCT TCG GAG GTG TTT GCG GCG TCG ATG CGG GAG TGT GCG 
Leu Leu Gly Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala 
610 615 620 

CGG GCG CTG AGT GTT CAT GTG GGG TGG GAT TTG CTG GAG GTG GTG TCG 
Arg Ala Leu Ser Val His Val Gly Trp Asp Leu Leu Glu Val Val Ser 
625 630 635 

GGC GGG GCC GGG TTG GAG CGG GTG GAT GTG GTG CAG CCG GTG ACG TGG 
Gly Gly Ala Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr Trp 
640 645 650 
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10 



GCG GTG ATG GTG TCG CTG GCC CGG TACTGG CAG GCG ATC GGT GTG GAC 22016 
Ala Val Met Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp 
655 660 665 

GTG GCT GCG GTG GTG GGT CAT TCC CAG GGG GAG ATC GCT GCT GCC ACG 22064 
Val Ala Ala Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala Thr 
670 675 680 68S 

GTG GCG GGG GCG TTG TCG CTG GAG GAT GCG GCG GCT GTG GTC GCT CTG 22112 
Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu 
690 695 700 

CGG GCG GGG TTG ATT GGC CGG TAT CTG GCG GGT CGT GGT GCG ATG GCG 22160 
Arg Ala Gly Leu He Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala 
705 710 715 

GCT GTT CCG CTG CCT GCC GGC GAG GTC GAG GCC GGG CTG GCG AAG TGG 22208 
Ala Val Pro Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp 
720 725 - 730 

20 CCG GGT GTG GAG GTC GCG GCG GTC AAC GGT CCG GCG TCT ACG GTG GTT 22256 

Pro Gly Val Glu Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val 
735 740 745 

TCC GGG GAT CGG CGG GCG GTG GCC GGT TAT GTG GCC GTC TGT CAG GCG 22304 

2S Gly Asp Arg Ala Val Ala G1 y Ala Val Cys Gin Ala 

750 755 760 * 765 

GAG GGT GTG CAG GCT CGG TTG ATA CCG GTG GAC TAC GCC TCT CAC TCC 22352 
Glu Gly Val Gin Ala Arg Leu He Pro Val Asp Tyr Ala Ser His Ser 
770 775 * 780 



/5 



30 



CGC CAT GTG GAG GAC CTG AAG GGC GAG TTG GAG CGG GTG CTG TCC GGT 22400 
Arg His Val Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly 
785 790 ~ 795 

ATC CGC CCG CGC ACT CCG CGG GTG CCG GTG TGT TCC ACC GTC GCC GGA 22448 
He Arg Pro Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly 
800 805 810 



45 



SO 



Oft nit lit ^ G GT T 17° GAT 00(3 GGG TAT TGG TTC CGT AAT 22496 

. „, , /> t « , ~ -- - Tyr 

825 



Glu Gin Pro Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn 

40 815 820 



CTG CGG AAC CGG GTT GAG TTC TCC GCG GTG GTC GGT GGT TTG TTG GAG 22544 
Leu Arg Asn Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu 

835 840 845 

GAG GGC CAC CGT CGG TTC ATC GAG GTC AGT GCC CAC CCG GTA CTC GTT 22592 
Glu Gly Has Arg Arg Phe He Glu Val Ser Ala His Pro Val Leu Val 
850 855 860 

ml T£ t7 °f G ? C GAG 000 000 GAC CGG AGT GTC G AT GCC 22640 

His Ala He Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala 
865 870 8 75 



tu° ^ ACC CTG CGC CGC CAG GAC GAC AGC CCG CAC CGC CTG CTG ACC 
Thr Gly Thr Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr 
55 880 885 890 



22688 
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TCC ACC GCC GAG GCC TGG GCC CAC GGC GCC ACC CTC ACC TGG GAC CCC 
Ser Thr Ala Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro 
895 900 905 



22736 



GCC CTG CCC CCA GGC CAC CTC ACC ACC CTC CCC ACC TAC CCC TTC AAC 
Ala Leu Pro Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn 
910 915 920 925 



22784 



10 



CAC CAC CAC TAC TGG CTC GAC ACC ACC CCC ACC ACC CCC GCG ACG ACC 
His His His Tyr Trp Leu Asp Thr Thr Pro Thr Thr Pro Ala Thr Thr 
930 935 940 



22832 



15 



ACC CAG AGC CCC ACC GAT GCC CAG AAC CCC GCC GAC GCC CTT CCC TAC 22880 
Thr Gin Ser Pro Thr Asp Ala Gin Asn Pro Ala Asp Ala Leu Pro Tyr 
945 950 955 

AAG GTG AGT TGG AAG CGG TTG CGG GAC CAG GAC AGC TTG ACC GCG CGC 22928 
Lys Val Ser Trp Lys Arg Leu Arg Asp Gin Asp Ser Leu Thr Ala Arg 
960 965 970 



20 



CTC GAC GGC CGA TGG CTG CTG GTG GTA CCG GAG GCG TCG GCG GAC CCG 
Leu Asp Gly Arg Trp Leu Leu Val Val Pro Glu Ala Ser Ala Asp Pro 
975 980 985 



22976 



25 



30 



35 



40 



45 



50 



TCG GTT GCT GAG GGC GTC GCG CGC GAG CTG ACC GCG CGG GGC GCG ACC 
Ser Val Ala Glu Gly Val Ala Arg Glu Leu Thr Ala Arg Gly Ala Thr 
990 995 1000 1005 



23024 



55 



GTG GAG TCG CTG ACG GTC GAG CCG GGC GCC GAC CGT TCG CGG CTG CGC 23072 
Val Glu Ser Leu Thr Val Glu Pro Gly Ala Asp Arg Ser Arg Leu Arg 
1010 1015 1020 

GGG CTG CTG GTC GAC GCC ACG GAA CGG GAC GAA GCC GGG CCG CTG CGC 23120 
Gly Leu Leu Val Asp Ala Thr Glu Arg Asp Glu Ala Gly Pro Leu Arg 
1025 1030 1035 

GGG ATC GTC TCG CTG CTG GCG TTG GCC GGG GAC CAC GCC GGG GCC GAC 23168 
Gly He Val Ser Leu Leu Ala Leu Ala Gly Asp His Ala Gly Ala Asp 
1040 1045 1050 

GGG GCA CGC CCG GTG GTT CCG GCC GGC CTG GCA GCG TCA CTG GCG CTG 23216 
Gly Ala Arg Pro Val Val Pro Ala Gly Leu Ala Ala Ser Leu Ala Leu 
1055 1060 1065 

ATC CAG GCC GCG GGC GAC GCG GGC ACG GAG GCC GGG CTG TGG GCG GTG 23264 
He Gin Ala Ala Gly Asp Ala Gly Thr Glu Ala Gly Leu Trp Ala Val 
1070 1075 1080 1085 

ACC CGC GGC GCC GTC GCG GCC GTG CCC GGT GAC GTA CCG GCG CCG TCG 23312 
Thr Arg Gly Ala Val Ala Ala Val Pro Gly Asp Val Pro Ala Pro Ser 
1090 1095 1100 

CAG GCG CTG CTC TGG GGA TTC GGC CGG GTG GCC GGG ATC GAG CTG CCG 23360 
Gin Ala Leu Leu Trp Gly Phe Gly Arg Val Ala Gly He Glu Leu Pro 
1105 1110 1H5 

CAC TGC TGG GGC GGC CTG CTC GAC CTG CCG ACC GGG CCC GGC GAC TCC 23408 
His Cys Trp Gly Gly Leu Leu Asp Leu Pro Thr Gly Pro Gly Asp Ser 
1120 1125 H30 
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GGG TTC CGG CAA CTG GCC GCG ACC CTG GCG GGC CGT CCC GCG GAG GAC 
Gly Phe Arg Gin Leu Ala Ala Thr Leu Ala Gly Arg Pro Ala Glu Asp 
1135 1140 1145 



23456 



CAG GTG GCG CTG CGG GCT TCG GGC GCG TAC GGC CGC AGG CTG GTC CGG 
Gin Val Ala Leu Arg Ala Ser Gly Ala Tyr Gly Arg Arg Leu Val Arg 
1150 1155 1160 1165 



23504 



GCC TCC GCG GCG GGC GGC GCG GAC GGC TGG CGG CCG CGG GGA ACG GTG 
10 Ala Ser Ala Ala Gly Gly Ala Asp Gly Trp Arg Pro Arg Gly Thr Val 

1170 1175 1180 



23552 



15 



CTG GTG GTG GGC GAC ACC GCG GAG GTC GCC GGA CCG CTG GTG CGC TGG 23 600 

Leu Val Val Gly Asp Thr Ala Glu Val Ala Gly Pro Leu Val Arg Trp 
1185 1190 1195 

CTG CTC GGC AAC GGC GCA CGG CGG GTG ACC CTG TCC GGA CTC TCC GGC 23648 
Leu Leu Gly Asn Gly Ala Arg Arg Val Thr Leu Ser Gly Leu Ser Gly 
1200 1205 1210 



2o CCG CTG CCG GAG GAA CTC GCC GAT GTG GCG GCA CGG GTG ACC GTG GCG 

Pro Leu Pro Glu Glu Leu Ala Asp Val Ala Ala Arg Val Thr Val Ala 
1215 1220 1225 



25 



30 



35 



40 



45 



50 



23696 



55 



CCC TGT GAT CCG GCC GAT CGC CCC GCC CTG CGG ACG CTG CTC GCC GAA 23744 
Pro Cys Asp Pro Ala Asp Arg Pro Ala Leu Arg Thr Leu Leu Ala Glu 
1230 1235 1240 1245 

CAG GCG CCG ACC GCC GTG CTC GTG GCG CCC CCG GCC GTC CCG CCC ACG 23792 
Gin Ala Pro Thr Ala Val Leu Val Ala Pro Pro Ala Val Pro Pro Thr 
1250 1255 1260 

CCG CTC GCG GAG ATG ACC GCC GAG GCG TTA GCC ATC GCG CTG TCC GCG 23840 
Pro Leu Ala Glu Met Thr Ala Glu Ala Leu Ala He Ala Leu Ser Ala 
1265 1270 1275 

AAG ACC GGT CTG GTC GAC CGT CTG GAC TCG CTG CTC GAC GAG CCG GAC 23888 
Lys Thr Gly Leu Val Asp Arg Leu Asp Ser Leu Leu Asp Glu Pro Asp 
1280 1285 1290 

CCC CTG CTC GAG GAC GGG GAA CTC GAC GCG TTC GTC GTC TTC TCC TCC 2393 6 

Pro Leu Leu Glu Asp Gly Glu Leu Asp Ala Phe Val Val Phe Ser Ser 
1295 1300 1305 

GTG GCA GGG GTG TGG GGC GGC GCG GGA CAG GGT GGT TAC GCG GCC GGT 23984 
Val Ala Gly Val Trp Gly Gly Ala Gly Gin Gly Gly Tyr Ala Ala Gly 
1310 1315 1320 1325 

ACC GCG TAC CTC GAC GCG CTC GCC GAA TGC CGG CGG GCC GGG GGG CTG 24032 
Thr Ala Tyr Leu Asp Ala Leu Ala Glu Cys Arg Arg Ala Gly Gly Leu 
1330 1335 " 1340 

CCG GTC ACC TCG GTG GCG TGG ACG CCG TGG CTC GGT ACG CCG GCG GCG 24080 
Pro Val Thr Ser Val Ala Trp Thr Pro Trp Leu Gly Thr Pro Ala Ala 
1345 1350 1355 

GAC TCC CTG GGC GAG CAG ATG AGC CGA GCT GGC ATC ACC CCC CTG GAT 24128 
Asp Ser Leu Gly Glu Gin Met Ser Arg Ala Gly He Thr Pro Leu Asp 
1360 1365 1370 
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CCG GCG GCC TCG CTG GAT GCG CTC GCC CGT GCG GTG GGC CGG CGC GCG 
Pro Ala Ala Ser Leu Asp Ala Leu Ala Arg Ala Val Gly Arg Arg Ala 
1375 1380 1385 



24176 



GGC TGT GTG ACG GTC GCC GAC ATC GAC TGG GAG CGG TTC GCC TCC GCG 
Gly Cys Val Thr Val Ala Asp He Asp Trp Glu Arg Phe Ala Ser Ala 
1390 1395 1400 1405 



24224 



10 



TAC ACG GCC ACC CGT CCC ACG CCG ATG TTC GAC GAG GTG CCC GAG GTG 
Tyr Thr Ala Thr Arg Pro Thr Pro Met Phe Asp Glu Val Pro Glu Val 
1410 1415 1420 



24272 



75 



CGG CGG ATA CAG GCC GCG TGG GCG GAA GCG GAG GCC GAC GCC GCG CGC 24320 

Arg Arg He Gin Ala Ala Trp Ala Glu Ala Glu Ala Asp Ala Ala Arg 
1425 1430 1435 

AGC GGT GCC GGC GGC GAC TCG CAG CTG CTG CGC TCC CTC CGG GGC CGG 24368 

Ser Gly Ala Gly Gly Asp Ser Gin Leu Leu Arg Ser Leu Arg Gly Arg 

1440 1445 1450 



20 



CCC GAG GAG GCC CAA CTG GCG GAG CTG CTG CGG CTG GTG CGC ACC CAT 
Pro Glu Glu Ala Gin Leu Ala Glu Leu Leu Arg Leu Val Arg Thr His 
1455 1460 1465 



24416 



25 



GCC GCC GCG GTG CTC GGC CTG GGC TCG CCC GGC GCG GTG GAG GCG CGG 
Ala Ala Ala Val Leu Gly Leu Gly Ser Pro Gly Ala Val Glu Ala Arg 
1470 1475 1480 1485 



24464 



30 



CGT TCG TTC AAG GAC CTG GGC TTC AAC TCG GTG ACG GCG GTG GAG CTG 24512 
Arg Ser Phe Lys Asp Leu Gly Phe Asn Ser Val Thr Ala Val Glu Leu 
1490 1495 1500 

CGG AAC CGG CTG AAG GAG GCG ACG GGA CTC CGG CTG GAG GTG TCC CTG 24560 
Arg Asn Arg Leu Lys Glu Ala Thr Gly Leu Arg Leu Glu Val Ser Leu 
1505 1510 1515 



35 



GTC TTC GAC CAC CCG GAC CCG GCC TCC CTC GCC CGG CAT CTG CTG GAT 
Val Phe Asp His Pro Asp Pro Ala Ser Leu Ala Arg His Leu Leu Asp 
1520 1525 1530 



24608 



40 



CTC GCC CTC GGC CAG GAG CCG GAG GAG ACG CCG CGG GCG TTC GCG CTC 
Leu Ala Leu Gly Gin Glu Pro Glu Glu Thr Pro Arg Ala Phe Ala Leu 
1535 1540 1545 



24656 



45 



GAA CCC GCG CCG AAC GGG GAG CCG ATC GCG ATC GTG TCC ATG GCC TGC 24704 
Glu Pro Ala Pro Asn Gly Glu Pro He Ala He Val Ser Met Ala Cys 
1550 1555 1560 1565 

CGT ATG CCG GGG GGT GTC AGC ACG CCC GAG GAG CTG TGG CGG CTG CTG 24752 
Arg Met Pro Gly Gly Val Ser Thr Pro Glu Glu Leu Trp Arg Leu Leu 
1570 1575 1580 



50 



CGG GAC GGC AAG GAC GCG ATC GGG CCG TTC CCC GCC AAC CGG GGC TGG 
Arg Asp Gly Lys Asp Ala He Gly Pro Phe Pro Ala Asn Arg Gly Trp 
1585 1590 1595 



24800 



55 



GAC CTG GAG AAC CTC TAC GAC CCC GAC CCG GAC GCC GAC GGC CGC ACC 
Asp Leu Glu Asn Leu Tyr Asp Pro Asp Pro Asp Ala Asp Gly Arg Thr 
1600 1605 1610 



24848 
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TAT GTG CGC GAG GGC GGA TTC CTC CAC GAG GCA CCG GAC TTC GAC CCC 24896 
Tyr Val Arg Glu Gly Gly Phe Leu His Glu Ala Pro Asp Phe Asp Pro 
1615 1620 1625 

TCG TTC TTC GGC ATC TCG CCG CGC GAG GCG CTG GCG ATG GAC CCG CAG 24944 
Ser Phe Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin 
1630 1635 1640 1645 



CAG CGG CTG CTG CTG GAG ACT TCC TGG GAG GCC TTG GAG CGC GCC GGC 
10 Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly 

1650 1655 1660 



15 



20 TTC GAC GGC TAT CTG GGC ACC GGC AAC TCG GCG AGT GTG ATG TCG GGC 

Phe Asp Gly Tyr Leu Gly Thr Gly Asn Ser Ala Ser Val Met Ser Gly 
1695 1700 1705 
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24992 



ATC GAC CCG GCG AGG CTG CGC GGC AGC CGT ACC GGC GTC TTC GTC GGG 25040 
He Asp Pro Ala Arg Leu Arg Gly Ser Arg Thr Gly Val Phe Val Gly 
1665 1670 1675 

ACG AAC GGC CAG CAC TAC ATG CCG CTG CTC CAG AAC GGC GGG GAC AGC 25088 
Thr Asn Gly Gin His Tyr Met Pro Leu Leu Gin Asn Gly Gly Asp Ser 
1680 1685 1690 



25136 



25184 



55 



CGG CTG TCG TAC GTG TTC GGC CTC GAA GGC CCC GCC GTG ACC GTG GAC 
Arg Leu Ser Tyr Val Phe Gly Leu Glu Gly Pro Ala Val Thr Val Asp 
1710 1715 1720 1725 

ACC GCC TGC TCG GCG TCC CTG GTG GCG CTG CAC CTG GCG GTG CAG GCG 25232 
Thr Ala Cys Ser Ala Ser Leu Val Ala Leu His Leu Ala Val Gin Ala 
1730 1735 1740 

ATG CGG CGC GGC GAG TGC GAC ATG GCG CTG GTC GGC GGC GCG ACG GTG 25280 
Met Arg Arg Gly Glu Cys Asp Met Ala Leu Val Gly Gly Ala Thr Val 
1745 1750 1755 

ATG TCG ACG CCC GAG ATG CTG GTG GAG TTC TCC CGG CAG CGG GTG ATC 25328 
Met Ser Thr Pro Glu Met Leu Val Glu Phe Ser Arg Gin Arg Val He 
1760 1765 1770 

TCC GCC AAC GGC CGG TCG AGG GCC TTC GCC GCC GGT GCC GAC GGT GTG 2537 6 

Ser Ala Asn Gly Arg Ser Arg Ala Phe Ala Ala Gly Ala Asp Gly Val 
1775 1780 1785 

GCG CTC GGC GAG GGC GTG GGC GTC CTG CTG GTG GAG CGG CTG TCG GAC 25424 
Ala Leu Gly Glu Gly Val Gly Val Leu Leu Val Glu Arg Leu Ser Asp 
17 90 1795 1800 ~ 1805 

GCC GAG CGC AAC GGC CAT CCG GTG CTG GCG GTG GTC CGC GGC TCG GCG 25472 
Ala Glu Arg Asn Gly His Pro Val Leu Ala Val Val Arg Gly Ser Ala 
1310 1815 1820 

GTC AAC CAG GAC GGC GCC TCC AAC GGG CTG ACG GCG CCC AAC GGG CCC 25520 
Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro 
1825 1830 1835 

TCC CAG CAG CGG GTG ATC CGG CAG GCG CTG GCG GAC GCC GGG CTG CGG 25568 
Ser Gin Gin Arg Val He Arg Gin Ala Leu Ala Asp Ala Gly Leu Arg 
1840 1845 1850 



42 
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CCC GAG GAG ATC GAC GCC GTC GAG GCG CAC GGC ACC GGC ACC GAG CTG 
Pro Glu Asp lie Asp Ala Val Glu Ala His Gly Thr Gly Thr Glu Leu 
1855 1860 1865 



25616 



GGC GAC CCC ATC GAG GCC GAG GCG CTG CTC GCC ACC TAT GGA AGG ACC 
Gly Asp Pro lie Glu Ala Glu Ala Leu Leu Ala Thr Tyr Gly Arg Thr 
1870 1875 1880 " 1885 



25664 



10 



CGT ACG GCG GAC CGC CCG CTG TGG CTC GGC TCC CTG AAG TCC AAC ATC 
Arg Thr Ala Asp Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn lie 
1890 1895 1900 



25712 



15 



GGG CAC ACC CAG GCC GCC GCC GGC GTG GCG GGC GTC ATC AAG ATG GTG 25760 

Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val 
1905 1910 1915 

CTC GCG CTG GGC AAC GAG AC A CTG CCG CGC ACC CTG CAC GTG GAT GAG 25808 

Leu Ala Leu Gly Asn Glu Thr Leu Pro Arg Thr Leu His Val Asp Glu 

1920 1925 1930 



20 



CCC ACA CCG CGC GTG GAC TGG TCC TCT GGC GCG GTC TCC CTG CTC ACC 
Pro Thr Pro Arg Val Asp Trp Ser Ser Gly Ala Val Ser Leu Leu Thr 
1935 1940 1945 



25856 



25 



GAG CCG GTG GAC TGG CCC GCC GGC CCG TCC GCG CCG CGC CGT GCG GCC 
Glu Pro Val Asp Trp Pro Ala Gly Pro Ser Ala Pro Arg Arg Ala Ala 
1950 1955 1960 1965 



25904 



30 



GTG TCC TCG TTC GGC ATC AGC GGC ACC AAC GCC CAC ACG ATC CTG GAG 25952 
Val Ser Ser Phe Gly He Ser Gly Thr Asn Ala His Thr He Leu Glu 
1970 1975 1980 

CAG GCC CCC GTC CCC GCG GAG TCC CGC CCC GGG ACG GAG CCG GCG GAC 26000 
Gin Ala Pro Val Pro Ala Glu Ser Arg Pro Gly Thr Glu Pro Ala Asp 
1985 1990 1995 



35 



GGC ACG GGC GCG TGG GAG AAC GTG ACC GTT CCG CTG CTG CTG TCC GGC 
Gly Thr Gly Ala Trp Glu Asn Val Thr Val Pro Leu Leu Leu Ser Gly 
2000 2005 2010 



26048 



40 



CAC ACC GAG GCG GCG CTG CGC GAG CAG AGC ACG AGG CTG CTG AAC GAC 
His Thr Glu Ala Ala Leu Arg Glu Gin Ser Thr Arg Leu Leu Asn Asp 
2015 2020 2025 



26096 



45 



50 



CTG CTG GAG CAC CCG GAC GAG CAC CCG GCC GAC GTC GGC TAC ACC CTG 26144 
Leu Leu Glu His Pro Asp Glu His Pro Ala Asp Val Gly Tyr Thr Leu 
2030 2035 2040 " ^ 2045 

ATC ACC GGC AGG GCC CAC TTC GGG CAC CGG GCC GCC GTG ATC GGC GAG 26192 
He Thr Gly Arg Ala His Phe Gly His Arg Ala Ala Val He Gly Glu 
2050 2055 2060 

AGC CGG GAA GAA CTG CTC GAC GCC CTG AAG GCT CTG GCC GAG GGC CGC 26240 
Ser Arg Glu Glu Leu Leu Asp Ala Leu Lys Ala Leu Ala Glu Gly Arg 
2065 2070 2075 



55 



GAG CAC CAC ACC GTG GTA CGG GGC GAC GGG ACG GCC CAC CCG GAC CGG 
Glu His His Thr Val Val Arg Gly Asp Gly Thr Ala His Pro Asp Arg 
2080 2085 2090 



26288 
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CGC GTG GTC TTC GTC TTC CCC GGG CAG GGC TCG CAG TGG CCG TCG ATG 26336 
Arg Val Val Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Pro Ser Met 
2095 2100 2105 

5 GCC CGG GAC CTG CTC GAC CGC GCG CCC GCC TTC CGC GAG ACG GCG AAG 26384 

Ala Arg Asp Leu Leu Asp Arg Ala Pro Ala Phe Arg Glu Thr Ala Lys 
2110 2115 2120 2125 

GCC TGC GAC GCC GCG CTG AGC GTC CAT CTG GAC TGG TCC GTG CTC GAT 26432 
10 Ala Cys Asp Ala Ala Leu Ser Val His Leu Asp Trp Ser Val Leu Asp 

2130 2135 2140 



15 



25 



30 



GTC CTC CAG GAG AAG CCG GAC GCG CCG CCG CTG AGC CGG GTC GAC GTG 26480 
Val Leu Gin Glu Lys Pro Asp Ala Pro Pro Leu Ser Arg Val Asp Val 
2145 2150 2155 

GTG CAG CCC GTG CTG TTC ACG ATG ATG CTG TCG CTC GCC GCC TGC TGG 26528 
Val Gin Pro Val Leu Phe Thr Met Met Leu Ser Leu Ala Ala Cys Trp 
2160 2165 2170 

2Q CGG GAC CTC GGC GTC CAC CCG GCC GCC GTG GTG GGC CAC TCC CAG GGA 26576 

Arg Asp Leu Gly Val His Pro Ala Ala Val Val Gly His Ser Gin Gly 
2175 2180 2185 

GAG ATC GCG GCG GCC TGC GTG GCC GGC GCG CTC TCC CTG GAG GAC GCG 26624 
Glu He Ala Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Glu Asp Ala 
2190 2195 2200 2205 

GCG CGG ATC GTG GCG CTG CGC AGC CGG GCA TGG CTC ACA CTG GCC GGC 26672 
Ala Arg He Val Ala Leu Arg Ser Arg Ala Trp Leu Thr Leu Ala Gly 
2210 2215 2220 

AAG GGC GGC ATG GCC GCC GTC TCC CTG CCG GAA GCC CGG CTG CGC GAG 26720 
Lys Gly Gly Met Ala Ala Val Ser Leu Pro Glu Ala Arg Leu Arg Glu 
2225 2230 2235 

CGG ATC GAG CGG TTC GGG CAG CGG CTG TCG GTG GCC GCG GTC AAC AGC 26768 
Arg He Glu Arg Phe Gly Gin Arg Leu Ser Val Ala Ala Val Asn Ser 
2240 2245 2250 

CCG GGC ACG GCG GCG GTC GCC GGT GAC GTG GAC GCG CTG CGG GAA CTG 26816 
Pro Gly Thr Ala Ala Val Ala Gly Asp Val Asp Ala Leu Arg Glu Leu 
2255 2260 2265 

CTG GCG GAG CTG ACC GCG GAG GGC ATC CGG GCC AAG CCG ATC CCC GGC 26864 
Leu Ala Glu Leu Thr Ala Glu Gly He Arg Ala Lys Pro He Pro Gly 
22 ™ 2275 2280 2285 

GTG GAC ACG GCC GGC CAC TCC GCG CAG GTG GAC GGC CTG AAG GAG CAT 26912 
Val Asp Thr Ala Gly His Ser Ala Gin Val Asp Gly Leu Lys Glu His 
2290 2295 * * 2300 

CTC TTC GAG GTG CTG GCG CCG GTC TCC CCG CGC TCC TCG GAC ATC CCG 26960 
Leu Phe Glu Val Leu Ala Pro Val Ser Pro Arg Ser Ser Asp He Pro 
2305 2310 2315 

TTC TAC TCG ACG GTG ACG GGC GCG CCG CTG GAC ACC GAG CGG CTG GAC 27008 
Phe Tyr Ser Thr Val Thr Gly Ala Pro Leu Asp Thr Glu Arg Leu Asp 
55 2320 2325 2330 



35 



40 



45 



50 
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GCC GGG TAC TGG TAC CGC AAC ATG CGG GAG CCC GTG GAG TTC GAG AAG 
Ala Gly Tyr Trp Tyr Arg Asn Met Arg Glu Pro Val Glu Phe Glu Lys 
2335 2340 2345 



27056 



GCC GTC AGG GCA CTG ATC GCC GAC GGC TAC GAC CTG TTC CTG GAG TGC 
Ala Val Arg Ala Leu lie Ala Asp Gly Tyr Asp Leu Phe Leu Glu Cys 
2350 2355 2360 2365 



27104 



10 



AAC CCG CAC CCG ATG CTC GCC ATG TCG CTG GAC GAG ACA CTC ACC GAC 
Asn Pro His Pro Met Leu Ala Met Ser Leu Asp Glu Thr Leu Thr Asp 
2370 2375 2380 



27152 



15 



AGC GGC GGC CAC GGC ACC GTG ATG CAC ACC CTC CGC CGG CAG AAG GGC 27200 

Ser Gly Gly His Gly Thr Val Met His Thr Leu Arg Arg Gin Lys Gly 
2385 2390 2395 

AGC GCC AAG GAC TTC GGC ATG GCG CTC TGC CTC GCC TAT GTC AAC GGA 27248 

Ser Ala Lys Asp Phe Gly Met Ala Leu Cys Leu Ala Tyr Val Asn Gly 

2400 2405 2410 



20 



CTG GAG ATC GAC GGA GAA GCC CTC TTC GGC CCC GAC TCA CGC CGG GTG 
Leu Glu lie Asp Gly Glu Ala Leu Phe Gly Pro Asp Ser Arg Arg Val 
2415 2420 2425 



27296 



25 



30 



AAC CCG CCG ACG TAC CCG TTC CAG CGG GAG CGC TAC TGG TAC CAC CCC 
Asn Pro Pro Thr Tyr Pro Phe Gin Arg Glu Arg Tyr Trp Tyr His Pro 
2430 2435 2440 2445 



27344 



ACG AGC GGC AGG CGC GGC GAC ATC ACG GCG GCC GGC GTG GCC GAG GCG 27392 
Thr Ser Gly Arg Arg Gly Asp He Thr Ala Ala Gly Val Ala Glu Ala 
2450 2455 2460 

GAG CAC CCG CTG CTC GGC GCC GGC GTC GAA CTC CCG GAG ACC GGG GGC 27440 
Glu His Pro Leu Leu Gly Ala Gly Val Glu Leu Pro Glu Thr Gly Gly 
2465 2470 2475 



35 



ACG GTG TAC ACC GCG CGG TTC GGC CCG GAC AGC CGG CCG TGG CTG GCC 
Thr Val Tyr Thr Ala Arg Phe Gly Pro Asp Ser Arg Pro Trp Leu Ala 
2480 2485 2490 



27488 



40 



GAC CAC GCG CTG CTG GGC ACC GTG CTG CTG CCC GGC ACG GCA ATC CTG 
Asp His Ala Leu Leu Gly Thr Val Leu Leu Pro Gly Thr Ala He Leu 
2495 2500 2505 



27536 



45 



50 



GAC CTG GTG CTG TGG GCG GGC GAA CGC TCC GGC TGC GGC CGC GTC GGT 27584 
Asp Leu Val Leu Trp Ala Gly Glu Arg Ser Gly Cys Gly Arg Val Gly 
2510 2515 2520 2525 

GAA CTG GCG CTC CAG GCA CCG CTG GTC CTG CCG GAC AGC GGC GAC GTC 27632 
Glu Leu Ala Leu Gin Ala Pro Leu Val Leu Pro Asp Ser Gly Asp Val 
2530 2535 2540 

GAA CTG CGG CTG CTG GTC GGC GGC CCG GAC GAG GAG AAA CGG CGC ACC 27680 
Glu Leu Arg Leu Leu Val Gly Gly Pro Asp Glu Glu Lys Arg Arg Thr 
2545 2550 2555 



55 



GTC ACC GTG CAC GCG CGG CCC GCG GCG GCC GGC GCC GAG GCG CCG TGG 
Val Thr Val His Ala Arg Pro Ala Ala Ala Gly Ala Glu Ala Pro Trp 
2560 2565 * 2570 
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ACC CGG CAC GCC GAA GCC GTG GTG CTG CCC GCC ACC GGC GAG GAG CCG 27776 
Thr Arg His Ala Glu Ala Val Val Leu Pro Ala Thr Gly Glu Glu Pro 
2575 2580 2585 

5 ACC CCC GCC CCG CGC CCC GTC CCC GAG CCG GCG GGC ACC ACG GAC CCC 27824 

Thr Pro Ala Pro Arg Pro Val Pro Glu Pro Ala Gly Thr Thr Asp Pro 
2590 2595 2600 2605 

GCC GCG TTC TAC GCG GAG TTC GCC GAG CGC GGC TAC GAC TAC GGC CCG 27872 
10 Ala Ala Phe Tyr Ala Glu Phe Ala Glu Arg Gly Tyr Asp Tyr Gly Pro 

2610 2615 2620 



15 



GCC TTC CAG GGC TTC ACC GCC GGA GCG CGC CAC GGC GAG GAC GTC GTC 27920 
Ala Phe Gin Gly Phe Thr Ala Gly Ala Arg His Gly Glu Asp Val Val 
2625 2630 2635 

GCC GAG GTG GCG CTG CCC AGC GGC CTG GTG GCG GAC GCC CGT CAC CAC 27968 
Ala Glu Val Ala Leu Pro Ser Gly Leu Val Ala Asp Ala Arg His His 
2640 2645 2650 



20 CGG CTG CAC CCG GCG CTG CTC GAC GCC GCG CTC CAG GCG ATG ATC CTC 

Arg Leu His Pro Ala Leu Leu Asp Ala Ala Leu Gin Ala Met He Leu 
2655 2660 2665 



28016 



25 



30 



35 



40 



45 



50 



55 



GGC ACG TTC TTT GCC GAC GAC GGC CGC GCC CGG ATG CCG TTC GCG GTG 28064 
Gly Thr Phe Phe Ala Asp Asp Gly Arg Ala Arg Met Pro Phe Ala Val 
2670 2675 2680 2685 

CGC GGA GTA CGG CTG CAC ACG GCC GGC GCC GAC CGG CTG CGC GTC CTG 28112 
Arg Gly Val Arg Leu His Thr Ala Gly Ala Asp Arg Leu Arg Val Leu 
2690 2695 2700 

ATC TCC CCG GCG GGC GAC GAG ACC GTA CGG CTG CTC TGC ACC GAC CTC 28160 
He Ser Pro Ala Gly Asp Glu Thr Val Arg Leu Leu Cys Thr Asp Leu 
2705 2710 2715 

GCG ACC GGC GCC CCC GTG CTG GAG ATC GAC GAA CTG GTC GTC CGC CCG 28208 
Ala Thr Gly Ala Pro Val Leu Glu He Asp Glu Leu Val Val Arg Pro 
2720 2725 2730 

GTG TCC GGC GAG CAG TTG GCG GCC GGC GCC CCG GGC CGC AAC GGC GGC 28256 
Val Ser Gly Glu Gin Leu Ala Ala Gly Ala Pro Gly Arg Asn Gly Gly 
2735 2740 2745 

GAG CTG TAC CGG GTC GAC TGG ACG GTG CTG CCG GAG CCC GCC GAG GTG 28304 
Glu Leu Tyr Arg Val Asp Trp Thr Val Leu Pro Glu Pro Ala Glu Val 
2750 2755 2760 2765 

CCC GCG CCG CGC TGG GCC CTC CTC GGC GAG GAC CAC GCC GGC CTG GCC 28352 
Pro Ala Pro Arg Trp Ala Leu Leu Gly Glu Asp His Ala Gly Leu Ala 
2770 2775 2780 

GAT GTG CTC GGA GGG ACG GGC GGC GGC TGC GAG CGG TAC GAC ACC CTC 28400 
Asp Val Leu Gly Gly Thr Gly Gly Gly Cys Glu Arg Tyr Asp Thr Leu 
2785 2790 2795 

ACC GGC CTG CTG GAG GCC ACC ACC CGG TCG GCC GGC GGA ATC CTG CCC 28448 
Thr Gly Leu Leu Glu Ala Thr Thr Arg Ser Ala Gly Gly He Leu Pro 
2800 2805 2810 
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GAC ATC GTC GCG CTC TCC TTG CCC ACC GCC CCG GAG CCC GGC CCC CAG 
Asp lie Val Ala Leu Ser Leu Pro Thr Ala Pro Glu Pro Gly Pro Gin 
2815 2820 2825 



28496 



GCG GTG CGC GAG GTG CTG TCC CAG GCG CTC GAC GCC GCC CAG GCG TGG 
Ala Val Arg Glu Val Leu Ser Gin Ala Leu Asp Ala Ala Gin Ala Trp 
2830 2835 2840 2845 



28544 



10 



CTG GCC GCC GGC GCC GAG ACC GCC TCC GCC CGG CTG GTG TTC GTC ACC 
Leu Ala Ala Gly Ala Glu Thr Ala Ser Ala Arg Leu Val Phe Val Thr 
2850 2855 2860 



28592 



75 



GGC GGC GCG GTG GCC ACC ACG GCG GAC GAA ACC GTG CGC GAC ATC GCG 28640 
Gly Gly Ala Val Ala Thr Thr Ala Asp Glu Thr Val Arg Asp He Ala 
2865 2870 2875 

GCG GCC GCC GTC TGG GGC CTG GTC CGC TCG GCG CAG TCC GAG GAA CCC 28688 
Ala Ala Ala Val Trp Gly Leu Val Arg Ser Ala Gin Ser Glu Glu Pro 
2880 2885 2890 



20 



GAC CGC ATG GTC CTG CTC GAC CTG GAC GGC GAG CGG CCC ACC GCG CGG 
Asp Arg Met Val Leu Leu Asp Leu Asp Gly Glu Arg Pro Thr Ala Arg 
2895 2900 2905 



28736 



25 



ACG CTG GCG GCG GCG CTC GCG TCC GGC GAA CCG CAA CTC GCC GTG CGC 
Thr Leu Ala Ala Ala Leu Ala Ser Gly Glu Pro Gin Leu Ala Val Arg 
2910 2915 2920 2925 



28784 



30 



GGC TCC ACG GTG GCC GCT CCC CGG CTG GCC CCG GCC GGG CCC GGC CCG 28832 
Gly Ser Thr Val Ala Ala Pro Arg Leu Ala Pro Ala Gly Pro Gly Pro 
2930 2935 2940 

GAG GAC CTC GTA CCG CCC GCC GGC ACC ACC GCC TGG CGG CTC ACC CCC 28880 
Glu Asp Leu Val Pro Pro Ala Gly Thr Thr Ala Trp Arg Leu Thr Pro 
2945 2950 2955 



35 



GGC GGG GGG ACG CTG GAG GAA CTG TCG CTC GCG CCC GCC CCC GAC GCG 
Gly Gly Gly Thr Leu Glu Glu Leu Ser Leu Ala Pro Ala Pro Asp Ala 
2960 2965 2970 



28928 



40 



GAG GAA CCA CTG GCA CCG GGC CAG GTA CGC ATC GCC GTC CGC GCG GCG 
Glu Glu Pro Leu Ala Pro Gly Gin Val Arg He Ala Val Arg Ala Ala 
2975 2980 2985 



28976 



45 



GGC GTG AAC TTC CGC GAC GCC CTG ATC GCC CTC GGC ATG TAC CCG GGC 29024 
Gly Val Asn Phe Arg Asp Ala Leu He Ala Leu Gly Met Tyr Pro Gly 
2990 2995 3000 3005 

AAG GGA ACC ATG GGC GCC GAG GGA GCC GGC GTC GTC GTC GAG ACC GCC 29072 
Lys Gly Thr Met Gly Ala Glu Gly Ala Gly Val Val Val Glu Thr Ala 
3010 3015 3020 



50 



CCC GAT GTC ACC GGC CTC TCC GCC GGA GAC CGC GTG CTC GGC ATG TGG 
Pro Asp Val Thr Gly Leu Ser Ala Gly Asp Arg Val Leu Gly Met Trp 
3025 3030 3035 



29120 



55 



AAC GGC GGC TTC GGG CCC CTC GTG GTG GCC GAC CAC CGC ATG GTG GCC 
Asn Gly Gly Phe Gly Pro Leu Val Val Ala Asp His Arg Met Val Ala 
3040 3045 3050 



29168 
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CCG ATC CCC CAC GGC TGG TCG TAC GCC GAG GCG GCC TCC GTG CCC GCC 
Pro He Pro His Gly Trp Ser Tyr Ala Glu Ala Ala Ser Val Pro Ala 
3055 3060 3065 

GTG CTC CTC ACC TCC TAC TAC GCG CTG ACC CGG CTG GCC CGG GCC CGC 
Val Leu Leu Thr Ser Tyr Tyr Ala Leu Thr Arg Leu Ala Arg Ala Arg 
3070 3075 3080 3085 

ACC GGA CAG ACC GTC CTC GTC CAC GCC GCC GCC GGC GGT GTC GGC ATC 
Thr Gly Gin Thr Val Leu Val His Ala Ala Ala Gly Gly Val Gly Met 
3090 3095 3100 

GCG ACC CTC CAA CTC GCC CGC CAC CTC GGC CTG GAG GTG TAC GCC ACC 
Ala Thr Leu Gin Leu Ala Arg His Leu Gly Leu Glu Val Tyr Ala Thr 
3105 3110 3115 

GCG AGC ACC GGC AAA TGG GAC GCC CTG CAG AAG CAC GGC ATC CCC GAC 
Ala Ser Thr Gly Lys Trp Asp Ala Leu Gin Lys His Gly He Pro Asp 
3120 3125 3130 

GAC CGC ATC GCC GAC TCC CGC ACC CTG GAC TTC GCC GAG CGC TTC CTC 
Asp Arg He Ala Asp Ser Arg Thr Leu Asp Phe Ala Glu Arg Phe Leu 
3135 3140 " 3145 

TCC CGG ACG GGC GGC CGG GGT GTC GAC ATC GTG CTG AAC TCC CTG GCC 
Ser Arg Thr Gly Gly Arg Gly Val Asp He Val Leu Asn Ser Leu Ala 
3150 3155 3160 3165 

GGC GAG TTC GTC GAC GCC TCA CTG CGG CTG CTG CCG CGC GGC GGG CAC 
Gly Glu Phe Val Asp Ala Ser Leu Arg Leu Leu Pro Arg Gly Gly His 
3170 3175 ~ 3180 

TTC CTG GAA CTC GGC AAG GCC GAC GTC CGC GAC CCC CGG CGG ATC GCC 
Phe Leu Glu Leu Gly Lys Ala Asp Val Arg Asp Pro Arg Arg He Ala 
3185 3190 3195 

GCC GCC CAT CCG GGC ACC GAC TAC CGG GCG TTC GAC CTG GTG CAG GCC 
Ala Ala His Pro Gly Thr Asp Tyr Arg Ala Phe Asp Leu Val Gin Ala 
3200 3205 3210 

GGT CCC GAC ACC GTC GGG GAG ATG CTC GGG GAA CTC CTG GAA CTG TTC 
Gly Pro Asp Thr Val Gly Glu Met Leu Gly Glu Leu Leu Glu Leu Phe 
3215 3220 3225 

GCG GCC GGA GCG CTG CGC CCG CTG CCG CTC ACC GCC TAC GGC ATA CGC 
Ala Ala Gly Ala Leu Arg Pro Leu Pro Leu Thr Ala Tyr Gly He Ara 
3230 3235 3240 3245 

GAC GCC CGC ACC GCC TTC CGC ACC CTC AGC CAG GCC CGG CAC ACC GGC 
Asp Ala Arg Thr Ala Leu Arg Thr Leu Ser Gin Ala Arg His Thr Gly 
325 <> 3255 ~ 3260 

AAG CTC GTG CTG ACG GTG CCT GCC GGA TTC GAC ACC CAC CGC ACG GTG 
Lys Leu Val Leu Thr Val Pro Ala Gly Phe Asp Thr His Arg Thr Val 
3265 3270 3275 

CTC CTC ACC GGC GGC ACG GGC ACG CTC GGC CAG ACA CTC GCC CGC CAT 
Leu Leu Thr Gly Gly Thr Gly Thr Leu Gly Gin Thr Leu Ala Arg His 
3280 3285 3290 



29216 
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CTG GTC AAC CGC CAC GGC GTA CGG CAC CTG CTG CTC GCC GGC CGC ACC 
Leu Val Asn Arg His Gly Val Arg His Leu Leu Leu Ala Gly Arg Thr 
3295 3300 3305 



29936 



GGC GCG GCG GCC GAG GGC GTC GCG GAA CTG ATT GGT GAA CTG GGC GAG 
Gly Ala Ala Ala Glu Gly Val Ala Glu Leu lie Gly Glu Leu Gly Glu 
3310 3315 3320 3325 



29984 



10 



TTG GGC GCC GAG GTC CGG GTC GCG GCC TGC GAC GCG GCC GAC CGG CAG 
Leu Gly Ala Glu Val Arg Val Ala Ala Cys Asp Ala Ala Asp Arg Gin 
3330 3335 3340 



30032 



15 



CGG CTC ACC GAA CTC CTC GCC GGA ATC CCC GTC GAG CAC CCG CTC GGC 30080 

Arg Leu Thr Glu Leu Leu Ala Gly He Pro Val Glu His Pro Leu Gly 
3345 3350 3355 

GCC GTC GTC CAC GCC GCG GGC ACC CTC GAC GAC GGC ACC ATC CCC TCA 30128 

Ala Val Val His Ala Ala Gly Thr Leu Asp Asp Gly Thr He Pro Ser 

3360 3365 3370 



20 



CTG ACC GGC GAG AAC ATC GAC AAC GTG CTG CGG CCC AAG GCC GAC GCC 
Leu Thr Gly Glu Asn He Asp Asn Val Leu Arg Pro Lys Ala Asp Ala 
3375 3380 3385 



30176 



25 



GTG CTC AAC CTG CAC GAG CTG ACC CGC GAC GCC GAC CTC TCG GCG TTC 
Val Leu Asn Leu His Glu Leu Thr Arg Asp Ala Asp Leu Ser Ala Phe 
3390 3395 3400 3405 



30224 



30 



GTC CTC TAC TCG TCC TCC TCG GCG CTG CTC GGC AGC CCC GGC CAG GGC 30272 
Val Leu Tyr Ser Ser Ser Ser Ala Leu Leu Gly Ser Pro Gly Gin Gly 
3410 3415 3420 

GCC TAC GCC GCG GCC AAC GCC TTC CTG GAC GGC TTC GCC CGA TAC CGC 30320 
Ala Tyr Ala Ala Ala Asn Ala Phe Leu Asp Gly Phe Ala Arg Tyr Arg 
3425 3430 3435 



35 



AAG GGC CTC GGG CTG CCG GCG CTC TCG CTG GCC TGG GGA CTG TGG GGC 
Lys Gly Leu Gly Leu Pro Ala Leu Ser Leu Ala Trp Gly Leu Trp Gly 
3440 3445 3450 



30368 



40 



AGC AAC AGC CGC ATG GCG GGC CAC CTC GAC CAG TCG GGC ATG CAA CGG 
Ser Asn Ser Arg Met Ala Gly His Leu Asp Gin Ser Gly Met Gin Arg 
3455 3460 3465 



30416 



45 



CGC CTG AAC CGG AGC GGC ATC ATG GCG CTC ACC GAC GCC GAG GGC CTC 30464 
Arg Leu Asn Arg Ser Gly He Met Ala Leu Thr Asp Ala Glu Gly Leu 
3470 3475 3480 3485 

GCC CTG TTC GAC GCC GCA CAG GAC GGC GGG GAC GCG CTG CTG GTG CCG 30512 
Ala Leu Phe Asp Ala Ala Gin Asp Gly Gly Asp Ala Leu Leu Val Pro 
3490 3495 3500 



ATG CGG CTC AAC CGG ACG GCC CTT CGC GCC TCG GGA CGG ATC ACC CCG 
50 Met Arg Leu Asn Arg Thr Ala Leu Arg Ala Ser Gly Arg He Thr Pro 

3505 3510 3515 



30560 



55 



TTC CTC AGC GGC TTG GCC GGC GGC GGG CCG GCG GCG GGG GAG AGG CGC 
Phe Leu Ser Gly Leu Ala Gly Gly Gly Pro Ala Ala Gly Glu Arg Arg 
3520 3525 3530 



30608 
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CCC GAG GTG GCA GCC GTA TCC GGG ACA CTC GCG GAA CGG CTG ACC GGG 
Pro Glu Val Ala Ala Val Ser Gly Thr Leu Ala Glu Arg Leu Thr Gly 
3535 3540 3545 

CTC ACG GCA CAG GAA GGG CAC GCC CTC GTC CTG GCC GAG ATC CGC GCC 
Leu Thr Ala Gin Glu Gly His Ala Leu Val Leu Ala Glu He Arg Ala 
3550 3555 3560 3565 

CAC GCG GCG GCG GTG CTG GGC CAC GGC TCC GAC GAC TCG ATC CCC GAG 
His Ala Ala Ala Val Leu Gly His Gly Ser Asp Asp Ser He Pro Glu 
3570 3575 3580 

GAC CGG GCC TTC AAG GAC CTC GGC TTC GAC TCG CTC ACC GCC GTG GAG 
Asp Arg Ala Phe Lys Asp Leu Gly Phe Asp Ser Leu Thr Ala Val Glu 
3585 3590 3595 

ATG CGC AAC CGG CTG AGC GCG GCC ACC GGC CTC CGG CTG CCC GCC ACC 
Met Arg Asn Arg Leu Ser Ala Ala Thr Gly Leu Arg Leu Pro Ala Thr 
3600 3605 3610 

CTC GTC TTC GAC CAC CCG ACC CCG GGC GAG CTG GCC GGC CAC CTG AGT 
Leu Val Phe Asp His Pro Thr Pro Gly Glu Leu Ala Gly His Leu Ser 
3615 3620 3625 

GCT GAA CTG TCC GCC GAC GAT GCC CCG GGC AGC GCC TCC CCG CTT ACC 
Ala Glu Leu Ser Ala Asp Asp Ala Pro Gly Ser Ala Ser Pro Leu Thr 
363 0 3635 3640 3645 

GAA CTC GAC CGT TTC GAA GCC CTG TTC ACC GCT CTC GCA CCG GGG ACC 
Glu Leu Asp Arg Phe Glu Ala Leu Phe Thr Ala Leu Ala Pro Gly Thr 
3550 3655 3660 

ACC AAG GAC ACC CCG GGC GGG GCC GGG GCA CTG ATG ATC GAC GAG GCC 
Thr Lys Asp Thr Pro Gly Gly Ala Gly Ala Leu Met He Asp Glu Ala 
3665 3670 3675 

GAG CGC CAA GAG ATC GCC GGG CGG CTC GCG GCG CTG GCC GGT CTG TCG 
Glu Arg Gin Glu He Ala Gly Arg Leu Ala Ala Leu Ala Gly Leu Trp 
3680 3685 3690 

AAC CGG CTG CAC GGC ACC ACG ACG GCT CCT GAG GAC GGC GAC ACC GTC 
Asn Arg Leu His Gly Thr Thr Thr Ala Pro Glu Asp Gly Asp Thr Val 
3695 3700 3705 

GCG GAC GCC CTG GAA GCC GCG GAC GAC CAC GAG ATC TTC GCA TTC CTC 
Ala Asp Ala Leu Glu Ala Ala Asp Asp His Glu He Phe Ala Phe Leu 
37 1° 3715 3720 3725 

GAC GAG CGG TTC TGA GCCCGCCCCA GCGACAGCAC AGGTGAAAAC AC ATG GCC 
Asp Glu Arg Phe * Met A1 

3730 l 

AAC GCG AAC GAG CAG CAA CTC CGT GCC TAT CTG AAG CGA GCG ACG ACC 
Asn Ala Asn Glu Gin Gin Leu Arg Ala Tyr Leu Lys Arg Ala Thr Thr 
5 10 is 

GAA CTC CAC CGT ACC TCC GAA CAA CTG AGG GAG GAG CGG GCA CGG GCC 
Glu Leu His Arg Thr Ser Glu Gin Leu Arg Glu Glu Arg Ala Arg Ala 
20 25 30 
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CAC GAG CCG ATC GCC GTC GTC GGC ATG GCC TGC CGC TAC CCC GGA GGC 31381 
His Glu Pro lie Ala Val Val Gly Met Ala Cys Arg Tyr Pro Gly Gly 
35 40 45 * 50 

GCG AAC ACC CCC GAA CAG TTC TGG GAA CTG CTC GAC ACC GGC ACC GAC 31429 
Ala Asn Thr Pro Glu Gin Phe Trp Glu Leu Leu Asp Thr Gly Thr Asp 
55 60 65 

GCC GCC GCG CCG ATG CCC TCC GAC CGG GGA TGG GAC ACC CAC GGG CTG 31477 
Ala Ala Ala Pro Met Pro Ser Asp Arg Gly Trp Asp Thr His Gly Leu 
70 75 80 

TAC GAC CCC GAC CCG GCG GCA GCG GGG CGC ACC TAC TGC CGG GAG GGC 31525 
Tyr Asp Pro Asp Pro Ala Ala Ala Gly Arg Thr Tyr Cys Arg Glu Gly 
85 90 95 

GGC TTC CTC CAC GAC GCG GGC GAC TTC GAC GCG GAC TTC TTC GGG ATT 31573 
Gly Phe Leu His Asp Ala Gly Asp Phe Asp Ala Asp Phe Phe Gly lie 
100 105 110 

TCG CCG CGT GAG GCG GTG GCG ATG GAT CCG CAG CAG CGG CTG TTG CTG 31621 
Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 130 



GAG ACC TCC TGG GAG GCG ATC GAA GCG GCC GGT ATC GAC CCG CGA GGA 31669 
Glu Thr Ser Trp Glu Ala He Glu Ala Ala Gly He Asp Pro Arg Gly 
25 135 140 145 

CTC CGC GGC AGC CGC ACC GGG GTG TAC GTG GGC GCC TGG GAC AGC GGC 31717 
Leu Arg Gly Ser Arg Thr Gly Val Tyr Val Gly Ala Trp Asp Ser Gly 
150 155 160 



55 



TAC ACC GGC CAG GCG CAC GCG CCC TCG GCC GAG TTG GAG GCC GAC CTG 31765 
Tyr Thr Gly Gin Ala His Ala Pro Ser Ala Glu Leu Glu Ala Asp Leu 
165 170 175 

CTG ACC GGC GGC GTC GTC AGC TTC ACC TCC GGC CGT ATC GCC TAC ACG 31813 
Leu Thr Gly Gly Val Val Ser Phe Thr Ser Gly Arg He Ala Tyr Thr 
180 185 190 

CTG GGG CTG GAG GGT CCG GCC TTG ACC GTG GAC ACC GCG TGT TCG TCG 31861 
Leu Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser 
195 200 205 * 210 

TCG CTG GTC GCC CTG CAC AAC GCG GCG CAG GCG CTG CGG CGC GGC GAA 31909 
Ser Leu Val Ala Leu His Asn Ala Ala Gin Ala Leu Arg Arg Gly Glu 
215 220 225 

TGC GAC CTG GCG TTG GCC GGT GGT GTG ACG GTG ATG GCG ACC CCG GCG 31957 
Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Ala 
230 235 240 

GTG TTC GTC CAG TTC GCC CGG CAG CGA GGG CTG GCG CCG GAC GGC CGC 32005 
Val Phe Val Gin Phe Ala Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg 
245 250 255 

TGC AAG GCG TTC GCC GAC GCC GCC GAC GGC TTC GGC CCC GCC GAG GGT 32053 
Cys Lys Ala Phe Ala Asp Ala Ala Asp Gly Phe Gly Pro Ala Glu Gly 
260 265 270 
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GTG GGG ATG GTG CTG GTG GAG CGG TTG TCG GAT GCC CGG CGG TTG GGG 32101 

Val Gly Met Val Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly 
275 280 285 ~ 290 

5 CAT CCG GTG TTG GCG GTG GTG TGT GGG TCG GCG GTG AAT CAG GAC GGT 32149 

His Pro Val Leu Ala Val Val Cys Gly Ser Ala Val Asn Gin Asp Gly 
295 300 305 

GCG TCG AAT GGT TTG ACG GCG CCG AGT GGT CCG TCG CAG GAG CGG GTG 32197 
1Q Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ser Gin Glu Arg Val 

310 315 320 

ATT CGT CAG GCG TTG GGG AAT GCG CGG TTG ACG GTG GCG GAT GTG GAT 32245 

He Arg Gin Ala Leu Gly Asn Ala Arg Leu Thr Val Ala Asp Val Asp 
325 330 335 

15 

GTG GTG GAG GCG CAT GGG ACG GGG ACG CGG CTG GGT GAT CCG ATC GAG 32293 

Val Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro He Glu 
340 345 350 

20 GCG CAG GCG TTG CTG GGG ACG TAT GGG CGG GAT CGT GAT GGT GGG CGT 32341 

Ala Gin Ala Leu Leu Gly Thr Tyr Gly Arg Asp Arg Asp Gly Gly Arg 
355 3 60 365 370 

CCG GTG TGG TTG GGG TCG TTG AAG TCG AAT ATT GGT CAT GCT CAG GCG 32389 
Pro Val Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala 
25 375 380 385 

GCT GCG GGG GTG GCT GGT GTG ATC AAG ATG GTG TTG GCG ATG CGG TAT 32437 
Ala Ala Gly Val Ala Gly Val He Lys Met Val Leu Ala Met Arg Tyr 
390 395 400 
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GGG TGG TTG CCG CGG ACG TTG CAT GTG GAT GAG CCG AGC CGG CAT GTG 32485 
Gly Trp Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Arg His Val 
405 410 415 

GAC TGG TCG GCT GGT GGT GTG CGG TTG CTG ACC GAG GCG CGG GAG TGG 32533 
Asp Trp Ser Ala Gly Gly Val Arg Leu Leu Thr Glu Ala Arg Glu Trp 
420 425 430 



CCG GGG GTG GAC CGG CCG CGT CGG GCG GCG GTC TCC GCC TTT GGT GTC 32581 
Pro Gly Val Asp Arg Pro Arg Arg Ala Ala Val Ser Ala Phe Gly Val 
40 4 3 5 4 4 0 4 4 5 4 5 0 

AGT GGT ACC AAC GCC CAT CTG ATC CTC GAA GCC CCC GAC ACC GCC GAG 32629 
Ser Gly Thr Asn Ala His Leu He Leu Glu Ala Pro Asp Thr Ala Glu 
455 460 465 

GCG GAG AGC GCC ACG ACC CCG GTC CGC TCT GAG GTG TCG GAG TCT GCT 32677 
Ala Glu Ser Ala Thr Thr Pro Val Arg Ser Glu Val Ser Glu Ser Ala 
470 475 480 

GCG GTC CTC GAT GCC CGC AGT GGT GTG GTG CCG GTG GTG GTT TCG GGG 32725 
Ala Val Leu Asp Ala Arg Ser Gly Val Val Pro Val Val Val Ser Gly 
485 490 495 

CGT TCG CGG GTG GTG GTG CGG GAG GCT GCG GGC CGG TTG GCG GAG GTG 32773 
Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly Arg Leu Ala Glu Val 
500 505 510 
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GTG GAG GCC GGT GGT GTG GGG CTG GCG GAT GTG GCG GTG ACG ATG GCG 32821 
Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val Ala Val Thr Met Ala 
515 520 525 530 

GGC CGG TCG CGG TTT GGG TAT CGG GCG GTT GTG CTG GCT CGG GGT GAG 32869 
Gly Arg Ser Arg Phe Gly Tyr Arg Ala Val Val Leu Ala Arg Gly Glu 
535 540 545 

GCT GAG CTT GCC GGG CGT TTG CGG GCG TTG GCG GGG GGT GAT CCG GAC 32917 
Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala Gly Gly Asp Pro Asp 
550 555 560 

GCG GGT GTG GTC ACG GGT GCG GTG GTG GAC CCG GAG ACG GGG TCC GGT 32965 
Ala Gly Val Val Thr Gly Ala Val Val Asp Pro Glu Thr Gly Ser Gly 
565 570 575 

GGT GGG GGG GTG GTG TTG GTT TTC CCT GGT CAG GGG ACG CAG TGG GTG 33013 
Gly Gly Gly Val Val Leu Val Phe Pro Gly Gin Gly Thr Gin Trp Val 
580 585 590 

GGG ATG GGT GCG GGG CTG CTG GGG TCT TCG GAG GTG TTT GCG GCG TCG 33061 
Gly Met Gly Ala Gly Leu Leu Gly Ser Ser Glu Val Phe Ala Ala Ser 
595 600 605 610 



ATG CGG GAG TGT GCG CGG GCG CTG AGT GTT CAT GTG GGG TGG GAT TTG 33109 
Met Arg Glu Cys Ala Arg Ala Leu Ser Val His Val Gly Trp Asp Leu 
25 615 620 625 

CTG GAG GTG GTG TCG GGC GGG GCC GGG TTG GAG CGG GTG GAT GTG GTG 33157 

Leu Glu Val Val Ser Gly Gly Ala Gly Leu Glu Arg Val Asp Val Val 
630 635 640 

30 

CAG CCG GTG ACG TGG GCG GTG ATG GTG TCG CTG GCC CGG TAC TGG CAG 33205 

Gin Pro Val Thr Trp Ala Val Met Val Ser Leu Ala Arg Tyr Trp Gin 
645 650 655 

GCG ATG GGT GTG GAC GTG GCT GCG GTG GTG GGT CAT TCC CAG GGG GAG 33253 
Ala Met Gly Val Asp Val Ala Ala Val Val Gly His Ser Gin Gly Glu 
660 665 670 

ATC GCT GCT GCC ACG GTG GCG GGG GCG TTG TCG CTG GAG GAT GCG GCG 33 301 

He Ala Ala Ala Thr Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala 
675 680 685 690 

GCT GTG GTC GCT CTG CGG GCG GGG TTG ATT GGC CGG TAT CTG GCG GGT 33349 
Ala Val Val Ala Leu Arg Ala Gly Leu He Gly Arg Tyr Leu Ala Gly 
695 700 705 

CGT GGT GCG ATG GCG GCT GTT CCG CTG CCT GCC GGC GAG GTC GAG GCC 33397 
Arg Gly Ala Met Ala Ala Val Pro Leu Pro Ala Gly Glu Val Glu Ala 
710 715 720 

GGG CTG GCG AAG TGG CCG GGT GTG GAG GTC GCG GCG GTC AAC GGT CCG 33445 
50 Gly Leu Ala Lys Trp Pro Gly Val Glu Val Ala Ala Val Asn Gly Pro 

725 730 735 

GCG TCT ACG GTG GTT TCC GGG GAT CGG CGG GCG GTG GCC GGT TAT GTG 33493 
Ala Ser Thr Val Val Ser Gly Asp Arg Arg Ala Val Ala Gly Tyr Val 
5S 740 745 750 
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GAA CGG CTG GTG GTC GAT CCG GTC GCC GTC GGA CGC GAC GGG CTC GCG 
Glu Arg Leu Val Val Asp Pro Val Ala Val Gly Arg Asp Gly Leu Ala 
995 1000 1005 1010 



34261 



GCG CGC CTG GGC GAA CGG TGG GAC GGT GTG CTG TCC CTG CTC GGC GCC 
Ala Arg Leu Gly Glu Arg Trp Asp Gly Val Leu Ser Leu Leu Gly Ala 
1015 1020 1025 



34309 



10 



GAC GAG CGT CCG CTC CCA CGG CAT CCC GCC CTC AAC CGC GCC GTC ATG 
Asp Glu Arg Pro Leu Pro Arg His Pro Ala Leu Asn Arg Ala Val Met 
1030 1035 1040 



34357 



75 



GGC ACC ACG CTG CTC GCC CAG GCC GCT CTG GAC GCA GGA TGC GAG GCG 34405 
Gly Thr Thr Leu Leu Ala Gin Ala Ala Leu Asp Ala Gly Cys Glu Ala 
1045 1050 1055 

CGG ATA TGG GCC GTG ACG CGG GAG GCC GTC GCC GTC TCC CCG AGC GAG 34453 
Arg He Trp Ala Val Thr Arg Glu Ala Val Ala Val Ser Pro Ser Glu 
1060 1065 1070 



20 



GTG CCG CGG GAC GCC GGC GCG CAG CTC TGG GGG CTC GGG CGG GGC ATC 
Val Pro Arg Asp Ala Gly Ala Gin Leu Trp Gly Leu Gly Arg Gly He 
1075 1080 1085 1090 



34501 



25 



GCG CTG GAA CAC CCC TCC CTC TGG GGC GGA TTG ATC GAT CTG CCC GCC 
Ala Leu Glu His Pro Ser Leu Trp Gly Gly Leu He Asp Leu Pro Ala 
1095 1100 1105 



34549 



30 



GTG CCG GAC GAA CGC GCG TGG GCC AGG GCC GTC CGG CGG CTC GTC CCG 34597 
Val Pro Asp Glu Arg Ala Trp Ala Arg Ala Val Arg Arg Leu Val Pro 
1110 1115 1120 

CAC GGT GAG GAC CAG ATC GCC GCG CGC GCC TCG GGT GCC TAT GGG CGC 34645 
His Gly Glu Asp Gin He Ala Ala Arg Ala Ser Gly Ala Tyr Gly Arg 
1125 1130 1135 



35 



AGG CTC CTG CCG GCT CCG CCG GCC GCG TCG CGC CGC ACC TGC ACA CCG 
Arg Leu Leu Pro Ala Pro Pro Ala Ala Ser Arg Arg Thr Cys Thr Pro 
1140 1145 1150 



34693 



40 



TCC GGC ACG GTG CTG GTC ACC GGC GGT ACG GGA GCG CTC GGC GGT CAT 
Ser Gly Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly Gly His 
1155 1160 1165 1170 



34741 
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CTG GCC CGC CGT CTC GCA CGC GGC GGG ACC GGG CAT CTG GTG CTC ACC 34789 
Leu Ala Arg Arg Leu Ala Arg Gly Gly Thr Gly His Leu Val Leu Thr 
1175 1180 1185 

AGC CGT CGC GGC CCG GAC GCG CCG GGC GCC GGT GAA CTC GCC GGT GAA 34837 
Ser Arg Arg Gly Pro Asp Ala Pro Gly Ala Gly Glu Leu Ala Gly Glu 
1190 1195 1200 



CTC GCC TCC CTG GGC GCG AAG GTC ACG GTC GCC GCG TGC GAC ATG GCC 
so Leu Ala Ser Leu Gly Ala Lys Val Thr Val Ala Ala Cys Asp Met Ala 

1205 1210 1215 



34885 
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GAC CGT GAA GCC GTG CGG GCG CTG CTC GAC GAG CAC CGG CCG ACC GCG 
Asp Arg Glu Ala Val Arg Ala Leu Leu Asp Glu His Arg Pro Thr Ala 
1220 1225 1230 
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GTG TTC CAC ACG GCG GGC ACG CCC CAC TCG GCG GAG TTC ACG GCG CTG 
Val Phe His Thr Ala Gly Thr Pro His Ser Ala Glu Phe Thr Ala Leu 
1235 1240 1245 1250 

GAC GAG ACG ACG ACG GCC GGG GTG TAC GGC GGG AAG GTC CTG GGT GCC 
Asp Glu Thr Thr Thr Ala Gly Val Tyr Gly Gly Lys Val Leu Gly Ala 
1255 1260 1265 

CGG CAT CTG GAC GAA CTG ACC CGG GAA CTC GGC ATC GGG CTG GAC GCG 
Arg His Leu Asp Glu Leu Thr Arg Glu Leu Gly lie Gly Leu Asp Ala 
1270 1275 1280 

TTC GTC CTC TTC TCC TCC GGC GCC GCG GTC TGG GGC AGC GGC GGC CAG 
Phe Val Leu Phe Ser Ser Gly Ala Ala Val Trp Gly Ser Gly Gly Gin 
1285 1290 1295 

ACC GCT TAC GGG GCC GCG AAC GCC GCG CTG GAC GCC CTC GCC GAG CGG 
Thr Ala Tyr Gly Ala Ala Asn Ala Ala Leu Asp Ala Leu Ala Glu Aro 
1300 1305 1310 

CGC CGT GCC GCC GGG CTG CCC GCG ACC TCC GTC GCC TGG GGC CTG TGG 
Arg Arg Ala Ala Gly Leu Pro Ala Thr Ser Val Ala Trp Gly Leu Trp 
"15 1320 1325 1330 

GGC GGC GGA GGC ATC GGG GAG GGG GAC GGT GAG GAG TTC CTC AGC CGG 
Gly Gly Gly Gly Met Gly Glu Gly Asp Gly Glu Glu Phe Leu Ser Arg 
1335 1340 1345 

CGC GGC CTC GGC GTG ATG CCG CCG GAG GAC GCG CTG GAA GCC CTG GAC 
Arg Gly Leu Gly Val Met Pro Pro Glu Asp Ala Leu Glu Ala Leu Asp 
1350 1355 1360 

CGG GCC CTG GAC CGG GAG GAC ACC ACC GTC GTG GTG GCG GAT GTC GAC 
Arg Ala Leu Asp Arg Glu Asp Thr Thr Val Val Val Ala Asp Val Asp 
1365 1370 1375 

TGG GAG CGG TTC GCC CCG GCC TTC ACC GCG TTC CGG CCC AGT GCG CTG 
Trp Glu Arg Phe Ala Pro Ala Phe Thr Ala Phe Arg Pro Ser Ala Leu 
1380 1385 1390 

ATC TCC CGG CTG GTC TCG GAC GGC GGG GAG GCG GGG GGG CAG GAC GCC 
lie Ser Arg Leu Val Ser Asp Gly Gly Glu Ala Gly Gly Gin Asp Ala 
1395 1400 1405 1410 

CCG GAC GGC ACG CTG TTC GCC GCC GGG TTC GCG GCC GCC GGG CCA CTG 
Pro Asp Gly Thr Leu Phe Ala Ala Gly Phe Ala Ala Ala Gly Pro Leu 
1415 1420 1425 

GAG CGG CAG GAG ATG CTG CTC GGC CTG GTG CGC CGG CAT GTG GCC GCC 
Glu Arg Gin Glu Met Leu Leu Gly Leu Val Arg Arg His Val Ala Ala 
1430 1435 " 1440 

GTA CTC GGC CAC CCG GGG ACC GCG GAC ATC GGT CCC GAC CGT GCT TTC 
Val Leu Gly His Pro Gly Thr Ala Asp He Gly Pro Asp Arg Ala Phe 
1445 1450 1455 

AAG GAG CTG GGG TTC AGT TCG GTC ACC GCC GTC GAG CTG GCC GGG CGG 
Lys Glu Leu Gly Phe Ser Ser Val Thr Ala Val Glu Leu Ala Gly Arg 
1460 1465 1470 
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CTG GGC CGG GAG TGC GGA CGG AAG CTG CCG CCG ACG CTG GTC TTC GAC 
Leu Gly Arg Glu Cys Gly Arg Lys Leu Pro Pro Thr Leu Val Phe Asp 
1475 1480 1485 1490 



35701 



CAT CCG ACT GCC GCG GCG GCC GTC GAA CAC CTG GCG GAG CTG CTG ACA 
His Pro Thr Ala Ala Ala Ala Val Glu His Leu Ala Glu Leu Leu Thr 
1495 1500 1505 



35749 



10 



CCG CCC GCC GGT CCC GCC GCC GGT CCC CGG GAG GAG GAG GCG CGG GCC 
Pro Pro Ala Gly Pro Ala Ala Gly Pro Arg Glu Glu Glu Ala Arg Ala 
1510 1515 1520 



35797 



75 



GCC CTG GCG CGC GTG CCG CTC GAA CGG CTG AGG GAA GCC GGC CTG CTG 35845 
Ala Leu Ala Arg Val Pro Leu Glu Arg Leu Arg Glu Ala Gly Leu Leu 
1525 1530 1535 

GAC GCA CTG CTG CGG CTC GCC GCG GAC GAA TCC GGG GCG ACA ACC CCC 35893 
Asp Ala Leu Leu Arg Leu Ala Ala Asp Glu Ser Gly Ala Thr Thr Pro 
1540 1545 1550 



20 



25 



CGT ACG TCT GCC GCG TCC GGC GCA CCC CGC GGC CGG GAG GAG CCG GAC 35941 
Arg Thr Ser Ala Ala Ser Gly Ala Pro Arg Gly Arg Glu Glu Pro Asp 
1555 1560 1565 1570 

GGC CGC GGC GAG CCG GAC GGC TCG GGA CAC CGC GAA AGC CCG GAC GCG 35989 
Gly Arg Gly Glu Pro Asp Gly Ser Gly His Arg Glu Ser Pro Asp Ala 
1575 1580 1585 
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GCC GGC GGG TCG GAC GCC CTG GAC GAT CTC GAC GGG GAC GCC CTG GTG 3 6037 

Ala Gly Gly Ser Asp Ala Leu Asp Asp Leu Asp Gly Asp Ala Leu Val 
1590 1595 1600 

CGG CTC GCC CTC GGG GAA CCG GGC GAG TGA CCGGCCGGCG GAGCACACCC 36087 
Arg Leu Ala Leu Gly Glu Pro Gly Glu * 
1605 1610 

GGCCGTCTCC GGCCCGGCCG CGGCCGGGCC GGAAGCCATC CGCCGCCCAC CCGGTACCGA 36147 

CCCCTCAAGC CCTTCAAGCC CTTCGACCCG TCCGATCAGT CAGTCCGGCG GTCCTCCACG 3 6207 

ACCGGTCCGG AATCGCCCCC ACACGAGTCA GGAAGCACAC C ATG GCC ATG TCC 3 6260 

Met Ala Met Ser 
1 

GCC GAG AGG CTG ACG GAG GCG CTG CGG ACC TCG CTC AAG GAG GCC GAG 36308 
Ala Glu Arg Leu Thr Glu Ala Leu Arg Thr Ser Leu Lys Glu Ala Glu 
5 10 15 20 

CGG CTC CGG CGG CAG AAC CGC GAA CTG AGG GCC GCG CGG GAC GCG GCG 3 6356 

Arg Leu Arg Arg Gin Asn Arg Glu Leu Arg Ala Ala Arg Asp Ala Ala 
25 30 35 

CGG GAG CCG ATC GCC GTC GTC GGC ATG GCC TGC CGC TAC CCG GGC GGT 36404 
Arg Glu Pro He Ala Val Val Gly Met Ala Cys Arg Tyr Pro Gly Gly 
40 45 ' " 50 

GTC ACC GGC CCC GAG GAG CTG TGG GAG CTG GTG GCC GGA GGC CGG GAC 36452 
Val Thr Gly Pro Glu Glu Leu Trp Glu Leu Val Ala Gly Gly Arg Asp 
55 60 65 
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GCG ATC GGG CCG TTC CCC GTG GAC CGG GGC TGG GAC GTG GCG TCG GTG 36500 
Ala lie Gly Pro Phe Pro Val Asp Arg Gly Trp Asp Val Ala Ser Val 
70 75 80 

S TAC GAC CCG GAT CCC GAG TCG AAG GGC ACC ACG TAC TGC CGG GAG GGC 36548 

Tyr Asp Pro Asp Pro Glu Ser Lys Gly Thr Thr Tyr Cys Arg Glu Gly 
85 90 95 100 

GGG TTC CTG GAA GGC GCC GGT GAC TTC GAC GCC GCC TTC TTC GGC ATC 3 6596 

10 Gly Phe Leu Glu Gly Ala Gly Asp Phe Asp Ala Ala Phe Phe Gly He 

105 HO 115 

TCG CCG CGC GAG GCC CTG GTG ATG GAC CCG CAG CAG CGG CTG CTG CTG 3 6644 

Ser Pro Arg Glu Ala Leu Val Met Asp Pro Gin Gin Arg Leu Leu Leu 
120 125 130 

75 

GAG GTG TCC TGG GAG GCG CTG GAA CGC GCG GGC ATC GAC CCG TCC TCG 36692 
Glu Val Ser Trp Glu Ala Leu Glu Arg Ala Gly He Asp Pro Ser Ser 
135 140 145 

20 CTG CGC «3C AGC CGC GGT GGT GTC TAC GTG GGC GCC GCG CAC GGC TCG 36740 

Leu Arg Gly Ser Arg Gly Gly Val Tyr Val Gly Ala Ala His Gly Ser 
150 155 160 

TAC GCC TCC GAT CCC CGG CTG GTG CCC GAG GGC TCG GAG GGC TAT CTG 36788 
Tyr Ala Ser Asp Pro Arg Leu Val Pro Glu Gly Ser Glu Gly Tyr Leu 
25 "5 . 170 175 180 

CTG ACC GGC AGC GCC GAC GCG GTG ATG TCC GGC CGC ATC TCC TAC GCG 46836 
Leu Thr Gly Ser Ala Asp Ala Val Met Ser Gly Arg He Ser Tyr Ala 
185 190 " 195 

CTC GGT CTC GAA GGA CCG TCC ATG ACG GTG GAG ACG GCC TGC TCC TCC 36884 
Leu Gly Leu Glu Gly Pro Ser Met Thr Val Glu Thr Ala Cys Ser Ser 
200 205 210 

TCG CTG GTG GCG CTG CAT CTG GCG GTA CGG GCG CTG CGG CAC GGC GAG 3 6932 

Ser Leu Val Ala Leu His Leu Ala Val Arg Ala Leu Arg His Gly Glu 
215 220 225 
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TGC GGG CTC GCG CTG GCG GGC GGG GTG GCG GTG ATG GCC GAT CCG GCG 3 6980 

Cys Gly Leu Ala Leu Ala Gly Gly Val Ala Val Met Ala Asp Pro Ala 
40 230 235 240 

GCG TTC GTG GAG TTC TCC CGG CAG AAG GGG CTG GCC GCC GAC GGC CGC 37028 
Ala Phe Val Glu Phe Ser Arg Gin Lys Gly Leu Ala Ala Asp Gly Arg 
245 250 255 260 

TGC AAG GCG TTC TCG GCC GCC GCC GAC GGC ACC GGC TGG GCC GAG GGC 37076 
Cys Lys Ala Phe Ser Ala Ala Ala Asp Gly Thr Gly Trp Ala Glu Gly 
265 270 275 

GTC GGC GTG CTC GTC CTG GAG CGG CTG TCG GAC GCG CGC CGC GCG GGG 37124 
Val Gly Val Leu Val Leu Glu Arg Leu Ser Asp Ala Arg Arg Ala Gly 
280 285 290 

CAC ACG GTC CTC GGC CTG GTC ACC GGC ACC GCG GTC AAC CAG GAC GGT 37172 
His Thr Val Leu Gly Leu Val Thr Gly Thr Ala Val Asn Gin Asp Gly 
295 300 305 
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GCC TCC AAC GGG CTG ACC GCG CCC AAC GGC CCA GCC CAG CAA CGC GTC 37220 
Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ala Gin Gin Arg Val 
310 315 320 

ATC GCC GAG GCG CTC GCC GAC GCC GGG CTG TCC CCG GAG GAC GTG GAC 37268 
lie Ala Glu Ala Leu Ala Asp Ala Gly Leu Ser Pro Glu Asp Val Asp 
325 330 335 340 

GCG GTC GAG GCG CAC GGC ACC GGC ACC CGG CTC GGC GAC CCC ATC GAG 37316 
Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro He Glu 
345 350 355 

GCC GGG GCG CTG CTC GCC GCC TCC GGA CGG AAC CGT TCC GGC GAC CAC 37364 
Ala Gly Ala Leu Leu Ala Ala Ser Gly Arg Asn Arg Ser Gly Asp His 
360 365 370 

CCG CTG TGG CTC GGC TCG CTG AAG TCC AAC ATC GGG CAT GCC CAG GCC 37412 
Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala 
375 380 385 

GCC GCC GGT GTC GGC GGC GTC ATC AAG ATG CTC CAG GCG CTG CGG CAC 37460 
Ala Ala Gly Val Gly Gly Val He Lys Met Leu Gin Ala Leu Arg His 
390 395 400 



GGC TTG CTG CCC CGC ACC CTC CAC GCC GAC GAG CCG ACC CCG CAT GCC 37508 
Gly Leu Leu Pro Arg Thr Leu His Ala Asp Glu Pro Thr Pro His Ala 
25 405 410 415 420 

GAC TGG AGC TCC GGC CGG GTA CGG CTG CTC ACC TCC GAG GTG CCG TGG 37556 
Asp Trp Ser Ser Gly Arg Val Arg Leu Leu Thr Ser Glu Val Pro Trp 
425 430 435 



CAG CGG ACC GGC CGG CCC CGG CGG ACC GGG GTG TCC GCC TTC GGC GTC 37604 
Gin Arg Thr Gly Arg Pro Arg Arg Thr Gly Val Ser Ala Phe Gly Val 
440 445 450 

GGC GGC ACC AAT GCC CAT GTC GTC CTC GAA GAG GCA CCC GCC CCG CCC 37652 
Gly Gly Thr Asn Ala His Val Val Leu Glu Glu Ala Pro Ala Pro Pro 
455 460 465 



GCG CCG GAA CCG GCC GGG GAG GCC CCC GGC GGC TCC CGC GCC GCA GAA 37700 

Ala Pro Glu Pro Ala Gly Glu Ala Pro Gly Gly Ser Arg Ala Ala Glu 
40 470 475 480 

GGG GCG GAA GGG CCC CTG GCC TGG GTG GTC TCC GGA CGC GAC GAG CCG 37748 

Gly Ala Glu Gly Pro Leu Ala Trp Val Val Ser Gly Arg Asp Glu Pro 

485 490 495 500 



55 



GCC CTG CGG TCC CAG GCC CGG CGG CTC CGC GAC CAC CTC TCC CGC ACC 37796 
Ala Leu Arg Ser Gin Ala Arg Arg Leu Arg Asp His Leu Ser Arg Thr 
505 510 515 

CCC GGG GCC CGC CCG CGT GAC ATC GCC TTC TCC CTC GCC GCC ACG CGC 37844 
Pro Gly Ala Arg Pro Arg Asp He Ala Phe Ser Leu Ala Ala Thr Arg 
520 525 530 

GCA GCC TTT GAC CAC CGC GCC GTG CTG ATC GGC TCG GAC GGG GCC GAA 37892 
Ala Ala Phe Asp His Arg Ala Val Leu He Gly Ser Asp Gly Ala Glu 
535 540 545 
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CTC GCC GCC GCC CTG GAC GCG TTG GCC GAA GGA CGC GAC GGT CCG GCG 37940 
Leu Ala Ala Ala Leu Asp Ala Leu Ala Glu Gly Arg Asp Gly Pro Ala 
550 555 560 

5 GTG GTG CGC GGA GTC CGC GAC CGG GAC GGC AGG ATG GCC TTC CTC TTC 37988 

Val Val Arg Gly Val Arg Asp Arg Asp Gly Arg Met Ala Phe Leu Phe 
565 570 575 580 

ACC GGG CAG GGC AGC CAG CGC GCC GGG ATG GCC CAC GAC CTG CAT GCC 38036 
10 Thr Gly Gin Gly Ser Gin Arg Ala Gly Met Ala His Asp Leu His Ala 

585 590 595 

GCC CAT ACC TTC TTC GCG TCC GCC CTC GAC GAG GTG ACG GAC CGT CTC 38084 
Ala His Thr Phe Phe Ala Ser Ala Leu Asp Glu Val Thr Asp Arg Leu 
600 605 610 

GAC CCG CTG CTC GGC CGG CCG CTC GGC GCG CTG CTG GAC GCC CGA CCC 38132 
Asp Pro Leu Leu Gly Arg Pro Leu Gly Ala Leu Leu Asp Ala Arg Pro 
615 620 625 

20 GGC TCG CCC GAA GCG GCA CTC CTG GAC CGG ACC GAG TAC ACC CAG CCG 38180 

Gly Ser Pro Glu Ala Ala Leu Leu Asp Arg Thr Glu Tyr Thr Gin Pro 
630 635 640 
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GCG CTC TTC GCC GTC GAG GTG GCG CTC CAC CGG CTG CTG GAG CAC TGG 38228 
Ala Leu Phe Ala Val Glu Val Ala Leu His Arg Leu Leu Glu His Trp 
645 650 655 660 

GGG ATG CGC CCC GAC CTG CTG CTG GGG CAC TCG GTG GGC GAA CTG GCG 38276 
Gly Met Arg Pro Asp Leu Leu Leu Gly His Ser Val Gly Glu Leu Ala 
665 670 ** 675 

GCC GCC CAC GTC GCG GGT GTG CTC GAT CTC GAC GAC GCC TGC GCG CTG 38324 
Ala Ala His Val Ala Gly Val Leu Asp Leu Asp Asp Ala Cys Ala Leu 
680 685 690 

GTG GCC GCC CGC GGC AGG CTG ATG CAG CGC CTG CCG CCC GGC GGC GCG 38372 
Val Ala Ala Arg Gly Arg Leu Met Gin Arg Leu Pro Pro Gly Gly Ala 
695 700 705 



ATG GTC TCC GTG CGG GCC GGC GAG GAC GAG GTC CGC GCA CTG CTG GCC 38420 
Met Val Ser Val Arg Ala Gly Glu Asp Glu Val Arg Ala Leu Leu Ala 
40 710 715 720 

GGC CGC GAG GAC GCC GTC TGC GTC GCC GCG GTG AAC GGC CCC CGG TCG 38468 
Gly Arg Glu Asp Ala Val Cys Val Ala Ala Val Asn Gly Pro Arg Ser 
725 730 735 * 740 



GTG GTG ATC TCC GGC GCG GAG GAA GCG GTG GCC GAG GCG GCG GCG CAG 38516 
Val Val lie Ser Gly Ala Glu Glu Ala Val Ala Glu Ala Ala Ala Gin 
745 750 755 

CTC GCC GGA CGA GGC CGC CGC ACC AGG CGG CTC CGC CTC GCG CAC GCC 38564 
Leu Ala Gly Arg Gly Arg Arg Thr Arg Arg Leu Arg Val Ala His Ala 
760 765 770 



TTC CAC TCA CCC CTG ATG GAC GGC ATG CTC GCC GGA TTC CGG GAG GTC 38612 
Phe His Ser Pro Leu Met Asp Gly Met Leu Ala Gly Phe Arg Glu Val 
55 775 780 785 
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GCC GCC GGC CTG CGC TAC CGG GAA CCG GAG CTG ACG GTC GTC TCC ACG 38660 
Ala Ala Gly Leu Arg Tyr Arg Glu Pro Glu Leu Thr Val Val Ser Thr 
790 795 800 

GTC ACG GGG CGG CCC GCC CGC CCC GGT GAA CTC ACC GGC CCC GAC TAC 38708 
Val Thr Gly Arg Pro Ala Arg Pro Gly Glu Leu Thr Gly Pro Asp Tyr 
805 810 815 ^ 820 

TGG GTG GCC CAG GTC CGT GAG CCC GTG CGC TTC GCG GAC GCG GTC CGC 38756 
Trp Val Ala Gin Val Arg Glu Pro Val Arg Phe Ala Asp Ala Val Arg 
825 830 835 

ACG GCA CAC CGC CTC GGA GCC CGC ACC TTC CTG GAG ACC GGC CCG GAC 38804 
Thr Ala His Arg Leu Gly Ala Arg Thr Phe Leu Glu Thr Gly Pro Asp 
840 845 850 

GGC GTG CTG TGC GGC ATG GCA GAG GAG TGC CTG GAG GAC GAC ACC GTG 38852 
Gly Val Leu Cys Gly Met Ala Glu Glu Cys Leu Glu Asp Asp Thr Val 
855 860 865 

GCC CTG CTG CCG GCG ATC CAC AAG CCC GGC ACC GCG CCG CAC GGT CCG 38900 
Ala Leu Leu Pro Ala lie His Lys Pro Gly Thr Ala Pro His Gly Pro 
870 875 880 



GCG GCT CCC GGC GCG CTG CGG GCG GCC GCC GCC GCG TAC GGC CGG GGC 38948 
Ala Ala Pro Gly Ala Leu Arg Ala Ala Ala Ala Ala Tyr Gly Arg Gly 
25 885 890 895 900 

GCC CGG GTG GAC TGG GCC GGG ATG CAC GCC GAC GGC CCC GAG GGG CCG 38996 
Ala Arg Val Asp Trp Ala Gly Met His Ala Asp Gly Pro Glu Gly Pro 
905 910 915 



GCC CGC CGC GTC GAA CTG CCC GTC CAC GCC TTC CGG CAC CGC CGC TAC 39044 
Ala Arg Arg Val Glu Leu Pro Val His Ala Phe Arg His Arg Arg Tyr 
920 925 930 

TGG CTC GCC CCG GGC CGC GCG GCG GAC ACC GAC GAC TGG ATG TAC CGG 39092 
Trp Leu Ala Pro Gly Arg Ala Ala Asp Thr Asp Asp Trp Met Tyr Arg 
935 940 945 



ATC GGC TGG GAC CGG CTG CCG GCT GTG ACC GGC GGG GCC CGG ACC GCC 39140 

lie Gly Trp Asp Arg Leu Pro Ala Val Thr Gly Gly Ala Arg Thr Ala 

40 950 955 960 

GGC CGC TGG CTG GTG ATC CAC CCC GAC AGC CCG CGC TGC CGG GAG CTG 39188 

Gly Arg Trp Leu Val He His Pro Asp Ser Pro Arg Cys Arg Glu Leu 

965 970 975 980 



55 



TCC GGC CAC GCC GAA CGC GCG CTG CGC GCC GCG GGC GCG AGC CCC GTA 39236 
Ser Gly His Ala Glu Arg Ala Leu Arg Ala Ala Gly Ala Ser Pro Val 
985 990 995 

CCG CTG CCC GTG GAC GCT CCG GCC GCC GAC CGG GCG TCC TTC GCG GCA 39284 
Pro Leu Pro Val Asp Ala Pro Ala Ala Asp Arg Ala Ser Phe Ala Ala 
1000 1005 1010 

CTG CTG CGC TCC GCC ACC GGA CCT GAC ACA CGA GGT GAC AC A GCC GCG 3 9332 

Leu Leu Arg Ser Ala Thr Gly Pro Asp Thr Arg Gly Asp Thr Ala Ala 
1015 1020 1025 
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CCC GTG GCC GGT GTG CTG TCG CTG CTG TCC GAG GAG GAT CGG CCC CAT 
Pro Val Ala Gly Val Leu Ser Leu Leu Ser Glu Glu Asp Arg Pro His 
1030 1035 1040 

CGC CAG CAC GCC CCG GTA CCC GCC GGG GTC CTG GCG ACG CTG TCC CTG 
Arg Gin His Ala Pro Val Pro Ala Gly Val Leu Ala Thr Leu Ser Leu 
1045 1050 1055 1060 

ATG CAG GCT ATG GAG GAG GAG GCG GTG GAG GCT CGC GTG TGG TGC GTC 
Met Gin Ala Met Glu Glu Glu Ala Val Glu Ala Arg Val Trp Cys Val 
1065 1070 1075 

TCC CGC GCC GCG GTC GCC GCC GCC GAC CGG GAA CGG CCC GTC GGC GCG 
Ser Arg Ala Ala Val Ala Ala Ala Asp Arg Glu Arg Pro Val Gly Ala 
1080 1085 1090 

GGC GCC GCC CTG TGG GGG CTG GGG CGG GTG GCC GCC CTG GAA CGC CCC 
Gly Ala Ala Leu Trp Gly Leu Gly Arg Val Ala Ala Leu Glu Arg Pro 
1095 1100 1105 

ACC CGG TGG GGC GGT CTC GTG GAC CTG CCC GCC TCG CCC GGT GCG GCG 
Thr Arg Trp Gly Gly Leu Val Asp Leu Pro Ala Ser Pro Gly Ala Ala 
1110 1115 1120 

CAC TGG GCG GCC GCC GTG GAA CGG CTC GCC GGT CCC GAG GAC CAG ATC 
His Trp Ala Ala Ala Val Glu Arg Leu Ala Gly Pro Glu Asp Gin He 
H25 H30 1135 1140 

GCC GTG CGC GCG TCC GGC AGT TGG GGC CGG CGC CTC ACC AGG CTG CCG 
Ala Val Arg Ala Ser Gly Ser Trp Gly Arg Arg Leu Thr Arg Leu Pro 
1145 H50 H55 

CGC GAC GGC GGC GGC CGG ACG GCC GCA CCC GCG TAC CGG CCG CGC GGC 
Arg Asp Gly Gly Gly Arg Thr Ala Ala Pro Ala Tyr Arg Pro Arg Gly 
1160 H65 H70 

ACG GTG CTC GTC ACC GGT GGC ACC GGC GCG CTC GGC GGG CAT CTC GCC 
Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly Gly His Leu Ala 
1175 H80 H85 

CGC TGG CTC GCC GCG GCG GGC GCC GAA CAC CTG GCG CTC ACC AGC CGC 
Arg Trp Leu Ala Ala Ala Gly Ala Glu His Leu Ala Leu Thr Ser Arg 
11^0 H95 1200 

CGG GGC CCG GAC GCG CCC GGC GCC GCC GGA CTC GAG GCC GAA CTC CTC 
Arg Gly Pro Asp Ala Pro Gly Ala Ala Gly Leu Glu Ala Glu Leu Leu 
1205 1210 1215 1220 

CTC CTG GGC GCC AAG GTG ACG TTC GCC GCC TGC GAC ACC GCC GAC CGC 
Leu Leu Gly Ala Lys Val Thr Phe Ala Ala Cys Asp Thr Ala Asp Arg 
1225 1230 1235 

GAC GGC CTC GCC CGG GTC CTG CGG GCG ATA CCG GAG GAC ACC CCG CTC 
Asp Gly Leu Ala Arg Val Leu Arg Ala He Pro Glu Asp Thr Pro Leu 
1240 1245 1250 

ACC GCG GTG TTC CAC GCC GCG GGC GTA CCG CAG GTC ACG CCG CTG TCC 
Thr Ala Val Phe His Ala Ala Gly Val Pro Gin Val Thr Pro Leu Ser 
1255 1260 1265 
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CGT ACC TCG CCC GAG CAC TTC GCC GAC GTG TAC GCG GGC AAG GCG GCG 
Arg Thr Ser Pro Glu His Phe Ala Asp Val Tyr Ala Gly Lys Ala Ala 
1270 1275 1280 



40100 



GGC GCC GCG CAC CTG GAC GAA CTG ACC CGC GAA CTC GGC GCC GGA CTC 
Gly Ala Ala His Leu Asp Glu Leu Thr Arg Glu Leu Gly Ala Gly Leu 
1285 1290 1295 1300 



40148 



10 



GAC GCG TTC GTC CTC TAC TCC TCC GGC GCC GGC GTC TGG GGC AGC GCC 
Asp Ala Phe Val Leu Tyr Ser Ser Gly Ala Gly Val Trp Gly Ser Ala 
1305 1310 1315 



40196 



15 



GGC CAG GGT GCC TAC GCC GCC GCC AAC GCC GCC CTG GAC GCG CTC GCC 4 0244 

Gly Gin Gly Ala Tyr Ala Ala Ala Asn Ala Ala Leu Asp Ala Leu Ala 
1320 1325 1330 

CGG CGC CGT GCG GCG GAC GGA CTC CCC GCC ACC TCC ATC GCC TGG GGC 40292 
Arg Arg Arg Ala Ala Asp Gly Leu Pro Ala Thr Ser lie Ala Trp Gly 
1335 1340 1345 



20 



GTG TGG GGC GGC GGC GGT ATG GGG GCC GAC GAG GCG GGC GCG GAG TAT 
Val Trp Gly Gly Gly Gly Met Gly Ala Asp Glu Ala Gly Ala Glu Tyr 
1350 1355 1360 



40340 



25 



CTG GGC CGG CGC GGT ATG CGC CCC ATG GCA CCG GTC TCC GCG CTC CGG 
Leu Gly Arg Arg Gly Met Arg Pro Met Ala Pro Val Ser Ala Leu Arg 
1365 1370 1375 1380 



40388 



30 



GCG ATG GCC ACC GCC ATC GCC TCC GGG GAA CCC TGC CCC ACC GTC ACC 4043 6 

Ala Met Ala Thr Ala He Ala Ser Gly Glu Pro Cys Pro Thr Val Thr 
1385 1390 1395 

CAC ACC GAC TGG GAG CGC TTC GGC GAG GGC TTC ACC GCC TTC CGG CCC 40484 
His Thr Asp Trp Glu Arg Phe Gly Glu Gly Phe Thr Ala Phe Arg Pro 
1400 1405 1410 



35 



AGC CCT CTG ATC GCG GGG CTC GGC ACG CCG GGC GGC GGC CGG GCG GCG 
Ser Pro Leu He Ala Gly Leu Gly Thr Pro Gly Gly Gly Arg Ala Ala 
1415 1420 1425 



40532 



40 



GAG ACC CCC GAG GAG GGG AAC GCC ACC GCT GCG GCG GAC CTC ACC GCC 
Glu Thr Pro Glu Glu Gly Asn Ala Thr Ala Ala Ala Asp Leu Thr Ala 
1430 1435 1440 



40580 



45 



CTG CCG CCC GCC GAA CTC CGC ACC GCG CTG CGC GAG CTG GTG CGA GCC 40628 
Leu Pro Pro Ala Glu Leu Arg Thr Ala Leu Arg Glu Leu Val Arg Ala 
1445 1450 1455 ~ 1460 

CGG ACC GCC GCG GCG CTC GGC CTC GAC Gkk CCG GCC GAG GTC GCC GAG 40676 
Arg Thr Ala Ala Ala Leu Gly Leu Asp Asp Pro Ala Glu Val Ala Glu 
1465 1470 1475 



50 



GGC GAA CGG TTC CCC GCC ATG GGC TTC GAC TCC CTG GCC ACC GTA CGG 
Gly Glu Arg Phe Pro Ala Met Gly Phe Asp Ser Leu Ala Thr Val Arg 
1480 1485 1490 



40724 



55 



CTG CGC CGC GGA CTC GCC TCG GCC ACG GGC CTC GAC CTG CCC CCC GAT 
Leu Arg Arg Gly Leu Ala Ser Ala Thr Gly Leu Asp Leu Pro Pro Asp 
1495 1500 1505 
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CTG CTC TTC GAC CGG GAC ACC CCG GCC GCG CTC GCC GCC CAC CTG GCC 
Leu Leu Phe Asp Arg Asp Thr Pro Ala Ala Leu Ala Ala His Leu Ala 
1510 1515 1520 

GAA CTG CTC GCC ACC GCA CGG GAC CAC GGA CCC GGC GGC CCC GGG ACC 
Glu Leu Leu Ala Thr Ala Arg Asp His Gly Pro Gly Gly Pro Gly Thr 
1525 1530 1535 ' 1540 

GGT GCC GCG CCG GCC GAT GCC GGA AGC GGC CTG CCG GCC CTC TAC CGG 
Gly Ala Ala Pro Ala Asp Ala Gly Ser Gly Leu Pro Ala Leu Tyr Arg 
1545 1550 1555 

GAG GCC GTC CGC ACC GGC CGG GCC GCG GAA ATG GCC GAA CTG CTC GCC 
Glu Ala Val Arg Thr Gly Arg Ala Ala Glu Met Ala Glu Leu Leu Ala 
1560 1565 1570 

GCC GCT TCC CGG TTC CGC CCC GCC TTC GGG ACG GCG GAC CGG CAG CCG 
Ala Ala Ser Arg Phe Arg Pro Ala Phe Gly Thr Ala Asp Arg Gin Pro 
1575 1580 1585 

GTG GCC CTC GTG CCG CTG GCC GAC GGC GCG GAG GAC ACC GGG CTC CCG 
Val Ala Leu Val Pro Leu Ala Asp Gly Ala Glu Asp Thr Gly Leu Pro 
1590 1595 1600 

CTG CTC GTG GGC TGC GCC GGG ACG GCG GTG GCC TCC GGC CCG GTG GAG 
Leu Leu Val Gly Cys Ala Gly Thr Ala Val Ala Ser Gly Pro Val Glu 
1605 1610 1615 1620 

TTC ACC GCC TTC GCC GGA GCG CTG GCG GAC CTC CCG GCG GCG GCC CCG 
Phe Thr Ala Phe Ala Gly Ala Leu Ala Asp Leu Pro Ala Ala Ala Pro 
1625 1630 1635 

ATG GCC GCG CTG CCG CAG CCC GGC TTT CTG CCG GGA GAA CGA GTC CCG 
Met Ala Ala Leu Pro Gin Pro Gly Phe Leu Pro Gly Glu Arg Val Pro 
1640 1645 ' 1650 

GCC ACC CCG GAG GCA TTG TTC GAG GCC CAG GCG GAA GCG CTG CTG CGC 
Ala Thr Pro Glu Ala Leu Phe Glu Ala Gin Ala Glu Ala Leu Leu Arg 
1655 1660 1665 

TAC GCG GCC GGC CGG CCC TTC GTG CTG CTG GGG CAC TCC GCC GGC GCC 
Tyr Ala Ala Gly Arg Pro Phe Val Leu Leu Gly His Ser Ala Gly Ala 
1670 1675 1680 

AAC ATG GCC CAC GCC CTG ACC CGT CAT CTG GAG GCG AAC GGT GGC GGC 
Asn Met Ala His Ala Leu Thr Arg His Leu Glu Ala Asn Gly Gly Gly 
1685 1690 1695 " 1700 

CCC GCA GGG CTG GTG CTC ATG GAC ATC TAC ACC CCC GCC GAC CCC GGC 
Pro Ala Gly Leu Val Leu Met Asp He Tyr Thr Pro Ala Asp Pro Gly 
1705 1710 1715 

GCG ATG GGC GTC TGG CGG AAC GAC ATG TTC CAG TGG GTC TGG CGG CGC 
Ala Met Gly Val Trp Arg Asn Asp Met Phe Gin Trp Val Trp Arg Arg 
1720 1725 1730 

TCG GAC ATC CCC CCG GAC GAC CAC CGC CTC ACG GCC ATG GGC GCC TAC 
Ser Asp He Pro Pro Asp Asp His Arg Leu Thr Ala Met Gly Ala Tyr 
1735 1740 1745 
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CAC CGG CTG CTT CTC GAC TGG TCG CCC ACC CCC GTC CGC GCC CCC GTA 41540 
His Arg Leu Leu Leu Asp Trp Ser Pro Thr Pro Val Arg Ala Pro Val 
1750 1755 1760 

5 CTG CAT CTG CGC GCC GCG GAA CCC ATG GGC GAC TGG CCA CCC GGG GAC 41588 
Leu His Leu Arg Ala Ala Glu Pro Met Gly Asp Trp Pro Pro Gly Asp 
1765 1770 1775 1780 

ACC GGC TGG CAG TCC CAC TGG GAC GGC GCG CAC ACC ACC GCC GGC ATC 41636 
1Q Thr Gly Trp Gin Ser His Trp Asp Gly Ala His Thr Thr Ala Gly He 

1785 1790 1795 

CCC GGA AAC CAC TTC ACG ATG ATG ACC GAA CAC GCC TCC GCC GCC GCC 41684 
Pro Gly Asn His Phe Thr Met Met Thr Glu His Ala Ser Ala Ala Ala 
1800 1805 1810 

CGG CTC GTG CAC GGC TGG CTC GCG GAA CGG ACC CCG TCC GGG CAG GGC 41732 
Arg Leu Val His Gly Trp Leu Ala Glu Arg Thr Pro Ser Gly Gin Gly 
1815 1820 1825 

GGG TCA CCG TCC CGC GCG GCG GGG AGA GAG GAG AGG CCG TGA 41774 
Gly Ser Pro Ser Arg Ala Ala Gly Arg Glu Glu Arg Pro * 
1830 1835 1840 

ACACGGCAGC CGGCCCGACC GGCACCGCCG CCGGCGGCAC CACCGCCCCG GCGGCGGCAC 41834 

ACGACCTGTC CCGCGCCGGA CGCAGGCTCC AACTCACCCG GGCCGCACAG TGGTTCGCCG 41894 

GCAACCAGGG AGACCCCTAC GGGATGATCC TGCGCGCCGG CACCGCCGAC CCGGCACCGT 41954 

ACGAGGAAGA GATCCGTGAG CGGGGGCCGC TGTTCCACAG CGAACTCCTC GGCGCCTGGG 42014 

TGACCGGCAG CCGCCATGTC GCCGACGCCG TGACGGCCGA CGACGCGTTC GGCGCCCTCA 42074 

CCGCGGACGG TGCACGGCCA GGAGTCCGCG AACTGCCGCT CTCCGGCAGC GCCCTCGACG 42134 

CCGCCCACGG GAACCCCGGC GGCCCGCCCC TCCCCGGAGG GTGGCCGCAC CGGCCCCCGG 42194 

ACAGGGAGGA GCGAGACGAC CCGGACCGGC ACGCGGCGGA CCTGCTGAAC GCCGCCGGCC 42254 

CGGGGCAGGT CCTCGACCTC GTCCCGTTCG CCCGGCGGCT GGCGGCCCGG ACGGCCGGCG 42314 

40 CGTGGCTGGG CGTCCCGGCG GAACGGCTGC CGCGCTTCGA GACGGCACTC ACCGGCTGCC 42374 

GCCGCGCCCT CGACGCCCTG CTCTGCCCCC AGCTCCTGGC CGACGCGCGG GCCGGACTGG 42434 

CCGCCGAGGA GGCCCTGCGC GCCGTGCTCG GCGAGACCCC GGkGGC ACGC GGACGTCCGC 42494 

CCGGCGCGGT CGAGGCGGCC CGCGCGCACG CCGTCAGCGC GGCGGAGCCC ATCGCCGTCC 42554 

TGCTGTGCAA CGCGGTGCGG GAACTGATGG AACGGCCGGC CCAGTGGCGG GCGCTCACCG 42614 

CCGACCCCGG CCTGGCGGGC GCCGCGATCA CCGAAACACT GCTCTGGGCA CCGCCGGTGC 42674 

GCCTGGAGAG CAGGGTGGCA CGCGAGACGG CCGTACTCGC CGGGCGGACG CTGCCCGCTG 42734 

GAACCCATCT CGTCGTCCTC GCCGCCGCCG CCAACCGCGA CGCCTGCCGG AACGCCGGTC 42794 

55 CGGCCGTCAC CGGCTTCGAC GTCCTCCGCC GCGCCTCGGA CGGCGGCCCC CAGCCCCACG 42854 
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Ser His Phe Leu Gly Leu Arg Gly Pro Ser Leu Val Val Asp Ser Ala 
180 185 190 

Gin Ser Ala Ser Leu Val Ala Val Gin Leu Ala Cys Glu Ser Leu Arg 
195 200 205 

Arg Gly Glu Thr Ser Leu Ala Val Ala Gly Gly Val Asn Leu lie Leu 
210 215 220 

Thr Glu Glu Ser Thr Thr Val Met Glu Arg Met Gly Ala Leu Ser Pro 
225 230 235 240 

Asp Gly Arg Cys His Thr Phe Asp Ala Arg Ala Asn Gly Tyr Val Arg 
245 250 255 

Gly Glu Gly Gly Gly Ala Val Val Leu Lys Pro Leu Asp Ala Ala Leu 
260 265 270 

Ala Asp Gly Asp Arg Val Tyr Cys Val He Lys Gly Gly Ala Val Asn 
275 280 285 

Asn Asp Gly Gly Gly Ala Ser Leu Thr Thr Pro Asp Arg Glu Ala Gin 
290 295 300 

Glu Ala Val Leu Arg Gin Ala Tyr Arg Arg Ala Gly Val Ser Thr Gly 
305 310 315 320 

Ala Val Arg Tyr Val Glu Leu His Gly Thr Gly Thr Arg Ala Gly Asp 
325 330 335 

Pro Val Glu Ala Ala Ala Leu Gly Ala Val Leu Gly Ala Gly Ala Asp 
340 345 350 

Ser Gly Arg Ser Thr Pro Leu Ala Val Gly Ser Val Lys Thr Asn Val 
355 360 365 

Gly His Leu Glu Gly Ala Ala Gly He Val Gly Leu lie Lys Ala Thr 
370 375 380 

Leu Cys Val Arg Lys Gly Glu Leu Val Pro Ser Leu Asn Phe Ser Thr 
385 390 395 400 

Pro Asn Pro Asp He Pro Leu Asp Asp Leu Arg Leu Arg Val Gin Thr 
405 410 ~ 415 

Glu Arg Gin Glu Trp Asn Glu Glu Asp Asp Arg Pro Arg Val Ala Gly 
420 425 " 430 

Val Ser Ser Phe Gly Met Gly Gly Thr Asn Val His Leu Val He Ala 
435 440 445 

Glu Ala Pro Ala Ala Ala Gly Ser Ser Gly Ala Gly Gly Ser Gly Ala 
450 455 460 

Gly Ser Gly Ala Gly He Ser Ala Val Ser Gly Val Val Pro Val Val 
465 470 475 480 



Val Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly Arg Leu 
485 490 495 
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Ala Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val Ala Val 
500 505 ' 510 

Thr Met Ala Asp Arg Ser Arg Phe Gly Tyr Arg Ala Val Val Leu Ala 
515 520 525 

Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala Gly Gly. 
530 535 540 

Asp Pro Asp Ala Gly Val Val Thr Gly Ala Val Leu Asp Gly Gly Val 
545 550 555 ' 560 

Val Val Gly Ala Ala Pro Gly Gly Ala Gly Ala Ala Gly Gly Ala Gly 
565 570 575 

Ala Ala Gly Gly Ala Gly Gly Gly Gly Val Val Leu Val Phe Pro Gly 
580 585 590 

Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly Leu Leu Gly Ser Ser 
595 600 605 

Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala Arg Ala Leu Ser Val 
610 615 620 

His Val Gly Trp Asp Leu Leu Glu Val Val Ser Gly Gly Ala Gly Leu 
625 630 635 640 

Glu Arg Val Asp Val Val Gin Pro Val Thr Trp Ala Val Met Val Ser 
645 650 655 

Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp Val Ala Ala Val Val 
660 665 670 

Gly His Ser Gin Gly Glu He Ala Ala Ala Thr Val Ala Gly Ala Leu 
675 680 685 

Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu Arg Ala Gly Leu lie 
690 695 700 

Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala Ala Val Pro Leu Pro 
705 710 715 720 

Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp Pro Gly Val Glu Val 
725 730 * 735 

Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val Ser Gly Asp Arg Arg 
740 745 750 

Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala Glu Gly Val Gin Ala 
755 760 765 

Arg Leu He Pro Val Asp Tyr Ala Ser His Ser Arg His Val Glu Asp 
770 775 780 



Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly He Arg Pro Arg Ser 
78 * 790 795 800 

Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly Glu Gin Pro Gly Glu 
805 810 815 
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Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val 
820 825 830 

Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu Glu Gly His Arg Arg 
835 840 845 

Phe He Glu Val Ser Ala His Pro Val Leu Val His Ala He Glu Gin 
850 855 860 

Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala Thr Gly Thr Leu Arg 
865 870 875 880 

Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr Ser Thr Ala Glu Ala 
885 890 895 

Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly 
900 905 910 

His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn His His His Tyr Trp 
915 920 925 

Leu Asp Thr He Asp Gly Gly Gly Gly Asp Asp Ala Thr Gin Glu Lys 
930 935 940 

Glu Ser Gly Pro Leu Thr Arg Glu Leu Arg Gly Leu Pro Ser Ser Gin 
945 950 955 960 

Lys Gin Leu Gly Phe Leu Leu Asp Leu Val Cys Arg His Thr Ala Val 
965 970 975 

Val Leu Gly Leu Asp Thr Ala Ala Glu Val Asp Pro Asp Leu Ser Phe 
980 985 990 

Lys Lys Gin Gly He Gin Ser Met Thr Gly Val Glu Leu Arg Asn Arg 
995 1000 1005 

Leu Leu Thr Glu Thr Gly Leu Ala Leu Pro Thr Thr Leu Val Tyr Asp 
1010 1015 1020 

Arg Pro Thr Pro Arg Ala Leu Ala Gin Phe Leu His Thr Glu Leu Leu 
1025 1030 1035 1040 

Asp Gly Ser Pro Ser Gly Ser Val Leu Ala Pro Ala Gin Lys Ser Phe 
1045 1050 1055 

Glu Ala Gin Glu Pro He Ala Val Val Gly Met Gly Cys Arg Phe Pro 
1060 1065 1070 

Gly Gly Val Gly Ser Pro Glu Ala Leu Trp Arg Leu Val Val Glu Gly 
1075 1080 1085 

Val Asp Ala Val Ser Pro Phe Pro Gly Asp Arg Gly Trp Asp Val Glu 
1090 1095 1100 

Gly Leu Tyr Asp Pro Glu Pro Gly Val Ala Gly Lys Ser Tyr Val Arg 
1105 1110 1115 1120 

Glu Gly Gly Phe Leu His Asp Ala Ala Glu Phe Asp Ala Glu Phe Phe 
1125 1130 1135 
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Gly lie Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gin Gin Arg Leu 
1140 1145 H50 

Leu Leu Glu Thr Ser Trp Glu Ala He Glu Arg Ala Gly He Asp Pro 
1155 H60 H65 

His Ser Leu His Gly Ser Arg Thr Gly Val Tyr Ala Gly Val Met Pro 
1170 H75 H80 

Gin Glu Tyr Gly Pro Arg Leu Ala Glu Gly Ala Glu Gly Ser Asp Gly 

H90 1195 1200 

Tyr Leu Leu Thr Gly Thr Ser Gly Ser Val Val Ser Gly Arg Val Ala 
1205 1210 ' 1215 

Tyr Thr Leu Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys 
1220 1225 1230 

Ser Ser Ser Leu Val Ala Leu His Leu Ala Val Gin Ala Leu Arg Gly 
1235 1240 1245 

Gly Glu Cys Asp Met Ala Leu Ala Gly Gly Val Thr Val Met Ala Gly 
1250 1255 1260 

Pro Gly Met Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp 
2265 1270 1275 1280 

Gly Arg Cys Lys Ala Phe Ala Asp Gly Ala Asp Gly Thr Ala Trp Ala 
1285 1290 1295 

Glu Gly Ala Gly Val Val Leu Val Glu Arg Leu Ser Asp Ala Arg Arg 
1300 1305 1310 

Leu Gly His Pro Val Leu Ala Val Val Cys Gly Ser Ala Val Asn Gin 
1315 1320 1325 

Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ser Gin Glu 
1330 1335 1340 

Arg Val He Arg Gin Ala Leu Gly Asn Ala Arg Leu Thr Val Ala Asp 
i345 1350 1355 1360 

Val Asp Val Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro 
1365 1370 1375 

He Glu Ala Gin Ala Leu Leu Gly Thr Tyr Gly Arg Asp Arg Asp Gly 
1380 1385 1390 

Gly Arg Pro Val Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala 
1395 1400 1405 

Gin Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val Leu Ala Met 
1410 1415 1420 

Arg Tyr Gly Trp Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Arg 
I 425 1430 1435 1440 

His Val Asp Trp Ser Ala Gly Gly Val Trp Leu Leu Thr Glu Ala Arg 
1445 1450 1455 
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Glu Trp Pro Gly Val Asp Arg Pro Arg Arg Ala Ala Val Ser Ala Phe 
1460 1465 1470 

Gly Val Ser Gly Thr Asn Ala His Leu He Leu Glu Ala Pro Asp Thr 
1475 1480 1485 

Ala Glu Ala Glu Ser Ala Thr Thr Pro Val Arg Ser Glu Val Ser Glu 
1490 1495 1500 

Ser Ala Ala Val Leu Asp Ala Arg Ser Gly Val Val Pro Val Val Val 
1505 1510 1515 1520 

Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly Arg Leu Ala 
1525 1530 1535 

Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val Ala Val Thr 
1540 1545 1550 

Met Ala Gly Arg Ser Arg Phe Gly Tyr Arg Ala Val Val Leu Ala Arg 
1555 1560 1565 

Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala Gly Gly Asp 
1570 1575 1580 

Pro Asp Ala Gly Val Val Thr Gly Ala Val Val Asp Pro Glu Thr Gly 
1585 1590 1595 1600 

Ser Gly Gly Gly Gly Val Val Leu Val Phe Pro Gly Gin Gly Thr Gin 
1605 1610 1615 

Trp Val Gly Met Gly Ala Gly Leu Leu Gly Ser Ser Glu Val Phe Ala 
1620 1625 1630 

Ala Ser Met Arg Glu Cys Ala Arg Ala Leu Ser Val His Val Glu Trp 
1635 1640 1645 

Asp Leu Leu Glu Val Val Ser Gly Gly Ala Gly Leu Glu Arg Val Asp 
1650 1655 1660 

Val Val Gin Pro Val Thr Trp Ala Val Met Val Ser Leu Ala Arg Tyr 
1665 1670 1675 1680 

Trp Gin Ala Met Gly Val Asp Val Ala Ala Val Val Gly His Ser Gin 
1685 1690 1695 

Gly Glu He Ala Ala Ala Thr Val Ala Gly Ala Leu Ser Leu Glu Asp 
1700 1705 1710 

Ala Ala Ala Val Val Ala Leu Arg Ala Gly Leu He Gly Arg Tyr Leu 
1715 1720 1725 

Ala Gly Arg Gly Ala Met Ala Ala Val Pro Leu Pro Ala Gly Glu Val 
1730 1735 1740 

Glu Ala Gly Leu Ala Lys Trp Pro Gly Val Glu Val Ala Ala Val Asn 
1745 1750 1755 1760 

Gly Pro Ala Ser Thr Val Val Ser Gly Asp Arg Arg Ala Val Ala Gly 
1765 1770 1775 
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Tyr Val Ala Val Cys Gin Ala Glu Gly Val Gin Ala Arg Leu lie Pro 
1780 1785 1790 

Val Asp Tyr Ala Ser His Ser Arg His Val Glu Asp Leu Lys Gly Glu 
1795 1800 1805 

Leu Glu Arg Val Leu Ser Gly He Arg Pro Arg Ser Pro Arg Val Pro 
1810 1815 1820 

Val Cys Ser Thr Val Ala Gly Glu Gin Pro Gly Glu Pro Val Phe Asp 
1825 1830 1835 1840 

Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val Glu Phe Ser Ala 
1845 1850 1855 

Val Val Gly Gly Leu Leu Glu Glu Gly His Arg Arg Phe He Glu Val 
I860 1865 1870 

Ser Ala His Pro Val Leu Val His Ala He Glu Gin Thr Ala Glu Ala 
1875 1880 1885 

Ala Asp Arg Ser Val His Ala Thr Gly Thr Leu Arg Arg Gin Asp Asp 
1890 1895 1900 

Ser Pro His Arg Leu Leu Thr Ser Thr Ala Glu Ala Trp Ala His Gly 
1^05 1910 1915 1920 

Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly His Leu Thr Thr 
1925 1930 1935 

Leu Pro Thr Tyr Pro Phe Asn His His His Tyr Trp Leu Asp Thr Thr 
1940 1945 1950 

Pro Thr Thr Pro Ala Thr Thr Thr Gin Ser Pro Thr Asp Ala Trp Arg 
1955 I960 1965 

Tyr Arg Val Thr Trp Lys Ala Leu Thr Glu Ser Ser Pro Val Arg Pro 
1970 1975 1980 

His Ser He Gly Arg Cys Leu Leu Val Ala Pro Pro Thr Thr Asp Gly 
1985 1990 1995 2000 

Glu Leu Leu Asp Gly Leu Thr Thr Val Leu Ser Glu Arg Gly Ala Ser 
2005 2010 2015 

Val Ala Arg Leu Glu Val Pro He Gly Ala Arg Arg Ala Glu Val Ala 
2020 2025 2030 

Glu Leu Leu Lys Pro Ser Met Glu Ser Ala Gly Glu Glu Asn Thr Thr 
2035 2040 * 2045 

Val Val Ser Leu Leu Gly Leu Val Pro Ser Thr Asp Ala Val Arg Thr 
2050 2055 2060 

Ser He Ala Leu Leu Gin Ala Val Ser Asp He Gly Val Pro Ala Ala 
2065 2070 2075 " 2080 

Arg Val Trp Ala Leu Thr Arg Arg Ala Val Ala Val Val Pro Gly Glu 
2085 2090 2095 
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Thr Pro Gin Asp Ala Gly Ala Gin Leu Trp Gly Phe Gly Arg Val Ala 
2100 2105 2110 

Ala Leu Glu Leu Pro Asp He Trp Gly Gly Leu He Asp Leu Pro Glu 
2115 2120 2125 

Thr Ala Glu Leu Thr Arg Thr Pro Glu Thr Ser Gin Pro Pro Gin Thr 
2130 2135 2140 

Pro Glu Arg Leu Pro Gin Thr Pro Asn Arg Arg Ala Leu Glu Leu Ala 
2145 2150 2155 2160 

Ala Ala Val Leu Ala Gly Arg Asp Gly Glu Asp Gin Val Ala Val Arg 
2165 2170 2175 

Ala Ser Gly He Tyr Gly Arg Arg Val Ser Arg Ala Ala Ala Ala Gly 
2180 2185 2190 

Ala Ala Ser Trp Gin Pro Ser Gly Thr Val Leu He Thr Gly Gly Met 
2195 2200 2205 

Gly Ala He Gly Arg Arg Leu Ala Arg Arg Leu Ala Ala Glu Gly Ala 
2210 2215 2220 

Glu Arg Leu Val Leu Thr Ser Arg Arg Gly Pro Glu Ala Pro Gly Ala 
2225 2230 2235 2240 

Ala Glu Leu Ala Glu Glu Leu Arg Gly His Gly Cys Glu Val Val His 
2245 2250 2255 

Ala Ala Cys Asp Val Ala Glu Arg Asp Ala Leu Ala Ala Leu Val Thr 
2260 2265 2270 

Ala Tyr Pro Pro Asn Ala Val Phe His Thr Ala Gly He Leu Asp Asp 
2275 2280 2285 

Ala Val He Asp Thr Leu Ser Pro Glu Ser Phe Glu Thr Val Arg Gly 
2290 2295 2300 

Ala Lys Val Cys Gly Ala Glu Leu Leu His Gin Leu Thr Ala Asp He 
2305 2310 2315 2320 

Lys Gly Leu Asp Ala Phe Val Leu Phe Ser Ser Val Thr Gly Thr Trp 
2325 2330 2335 

Gly Asn Ala Gly Gin Gly Ala Tyr Ala Ala Ala Asn Ala Ala Leu Asp 
2340 2345 2350 

Ala Leu Ala Glu Arg Arg Arg Ala Ala Gly Leu Pro Ala Thr Ser Val 
2355 2360 2365 

Ala Trp Gly Leu Trp Gly Gly Gly Gly Met Ala Ala Gly Ala Gly Glu 
2370 2375 2380 

Glu Ser Leu Ser Arg Arg Gly Leu Arg Ala Met Asp Pro Asp Ala Ala 
2385 2390 2395 2400 

Val Asp Ala Leu Leu Gly Ala Met Gly Arg Asn Asp Val Cys Val Thr 
2405 2410 2415 
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Val Val Asp Val Asp Trp Glu Arg Phe Ala Pro Ala Thr Asn Ala lie 
2420 2425 2430 

Arg Pro Gly Arg Leu Phe Asp Thr Val Pro Glu Ala Arg Glu Ala Leu 
2435 2440 2445 

Thr Ala Ala Gly Thr Thr Ser Ala Thr Pro Asp Gly Ala Pro Glu Leu 
2450 2455 2460 

Ala Arg Arg Leu Ser Met Leu Asn Glu Thr Glu Arg Leu Arg Lys Leu 
2465 2470 2475 2480 

Val Glu Leu Val Arg Thr Glu Ala Ala Phe Val Leu Arg His Pro Asn 
2485 2490 2495 

Thr Asp Ala lie Gly Ala Glu Arg Pro Phe Lys Ser Ala Gly Phe Asp 
2500 2505 2510 

Ser Leu Thr Ser Leu Glu Leu Arg Asn Arg Leu Asn Ala Gly Thr Gly 
2515 2520 2525 

Leu Lys Leu Pro Ala Thr Val He Phe Asp His Pro Ser Pro Thr Ala 
2530 2535 2540 

Leu Ala Arg Leu Leu Leu Asp Arg Leu Thr Gly Ala Gly Ala Pro Ala 
2545 2550 2555 2560 

Pro Ala Ala Asp Glu Pro Pro Leu Pro Val Ala Val Ala Asp Asp Asp 
2565 2570 2575 

Pro Val Val He Val Gly Met Ala Cys Arg Phe Pro Gly Gly Ala Glv 
2580 2585 2590 

Thr Pro Glu Ala Leu Trp Lys Leu Val Thr Glu Glu Arg Asp Val He 
2595 2600 2605 

Gly Ala Ala Pro Thr Asp Arg Gly Trp Asp Leu Asp Ser Val Tyr Asp 
26 10 2615 2620 

Pro Glu Pro Gly Val Ala Gly Lys Thr Tyr Val Arg Glu Gly Gly Phe 
2625 2630 2635 2640 

Leu His Asp Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro 
2645 2650 2655 

Arg Glu Ala Val Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr 
2660 2665 2670 

Ser Trp Glu Ala He Glu Arg Ala Gly He Asp Pro His Ser Leu His 
2675 2680 2685 

Gly ™* rB ^ Gly Val lyr Val G1 y Leu His Gin Glu Tyr Ala 
2690 2695 2700 

^L** 9 LeU His Glu Ala Pro 01x1 Glu ^ Q l u G ly Tyr Leu Leu Thr 
2705 27 1° 2715 2720 

Gly Lys Ser Ala Ser Val Val Ser Gly Arg He Ser Tyr Thr Leu Gly 
2725 2 730 2 735 
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Leu Glu Gly Pro Ser Leu Ser He Asp Thr Ala Cys Ser Ser Ser Leu 
2740 2745 2750 

Val Ala Leu His Asn Ala Ala Gin Ala Leu Arg Gly Gly Glu Cys Asp 
2755 2760 2765 

Met Ala Leu Ala Gly Gly Val Thr Val Met Ala Ala Pro Gly Leu Phe 
2770 2775 2780 

Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys 
2785 2790 2795 2800 

Ala Phe Ala Asp Gly Ala Asp Gly Thr Ala Trp Ala Glu Gly Ala Gly 
2805 2810 2815 

Val Val Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly His Pro 
2820 2825 2830 

Val Leu Ala Val Val Cys Gly Ser Ala Val Asn Gin Asp Gly Ala Ser 
2835 2840 2845 

Asn Gly Leu Thr Ala Pro Ser Gly Pro Ser Gin Glu Arg Val He Arg 
2850 2855 2860 

Gin Ala Leu Ala Asn Ala Arg Leu Thr Val Ala Asp Val Asp Val Val 
2865 2870 2875 2880 

Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro He Glu Ala Gin 
2885 2890 2895 

Ala Leu Leu Gly Thr Tyr Gly Arg Asp Arg Asp Ala Glu Cys Pro Val 
2900 2905 2910 

Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala Ala Ala 
2915 2920 2925 

Gly Val Ala Gly Val He Lys Met Val Leu Ala Met Arg Tyr Gly Trp 
2930 2935 2940 

Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Arg His Val Asp Trp 
2945 2950 2955 2960 

Ser Ala Gly Gly Val Arg Leu Leu Thr Glu Ala Arg Glu Trp Pro Gly 
2965 2970 " 2975 

Val Asp Arg Pro Arg Arg Ala Ala Val Ser Ala Phe Gly Val Ser Gly 
2980 2985 2990 

Thr Asn Ala His Leu He Leu Glu Ala Pro Glu Ala Leu Glu Ala Leu 
2995 3000 3005 

Glu Ala Thr Asp Ala Pro Glu Ala Pro Glu Ala Pro Glu Ala Pro Asp 
3010 3015 3020 

Val Thr Asp Val Thr Glu Ala Leu Glu Ala Pro Asp Ala Thr Glu Ala 
3025 3030 3035 3040 

Glu Gly Ala Lys Ala Pro Gly Ser Pro Glu Glu Ala Gin Pro Ala Val 
3045 3050 3055 
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Gly Val Val Pro Val Val Val Ser Gly Arg Ser Arg Val Val Val Arg 
3060 3 065 3070 

Glu Ala Ala Gly Arg Leu Ala Glu Val Val Glu Ala Gly Gly Val Gly 
5 3075 3080 3085 

Leu Ala Asp Val Ala Val Thr Met Ala Gly Arg Ser Arg Phe Gly Tyr 
3090 3 095 3100 

10 A **g Ala Val Val Leu Ala Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu 

3105 3110 3115 3120 

Arg Ala Leu Ala Gly Gly Asp Pro Asp Ala Gly Val Val Thr Gly Ala 
3125 3130 " 3135 

1 * Val Val As P Pro Glu Thr Gly Ser Gly Gly Gly Gly Val Val Leu Val 

3140 3145 3150 

Phe Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly Leu Leu 
3155 3160 3165 

Gly Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala Arg Ala 
3170 3175 3180 

Leu Ser Val His Val Glu Trp Asp Leu Leu Glu Val Val Ser Gly Gly 
3185 3190 3195 3200 

Ala Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr Trp Ala Val 
3205 3210 3215 

Met Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp Val Ala 
30 3220 3225 3230 

Ala Val Val Gly His Ser Gin Gly Glu lie Ala Ala Ala Thr Val Ala 
3235 3240 3245 

3S G1 y Ala Leu Se * Leu Glu Asp Ala Ala Ala Val Val Ala Leu Arg Ala 

3250 3255 3260 

Gly Leu lie Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala Ala Val 
3265 3270 3275 3280 

40 Pro Leu p ro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp Pro Gly 

3285 3290 3295 

Val Gin Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val Ser Gly 
3300 3305 3310 

45 

Asp Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala Glu Gly 
3315 3320 3325 

Val Gin Ala Arg Leu He Pro Val Asp Tyr Ala Ser His Ser Arg His 
sq 3330 3335 3340 

Val Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly He Arc 
3345 3350 3355 3360 

Pro Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly Glu Gin 
55 3365 3370 3375 
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Pro Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg 
3380 3385 3390 

Asn Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu Gin Gly 
3395 3400 3405 

His Arg Arg Phe He Glu Val Ser Ala His Pro Val Leu Val His Ala 
3410 3415 3420 

He Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala Thr Gly 
3425 3430 3435 3440 

Thr Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr Ser Thr 
3445 3450 3455 

Ala Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro Ala Leu 
3460 3465 3470 

Pro Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn His His 
3475 3480 3485 

His Tyr Trp Ala Val Thr Ser Pro Ala Gly Val Gly Asp Ala Ala Ala 
3490 3495 3500 

Gly Arg Phe Gly Met Thr Trp Glu Asp His Pro Phe Leu Arg Gly Gly 
3505 3510 3515 3520 

Leu Pro Leu Ala Asp Ser Gly Glu Arg Val Phe Ala Gly Arg Leu Ala 
3525 3530 3535 

Gly Ser Glu His Asp Trp Leu Thr Asp His Ala Val Ser Gly Val Thr 
3540 3545 3550 

Leu Leu Pro Gly Thr Ala Phe Val Glu Phe Ala Leu His Ala Gly Ala 
3555 3560 3565 

Ala Thr Gly Cys Gly Arg Leu Glu Glu Leu Ser Val Glu Ala Pro Leu 
3570 3575 3580 

Val Leu Pro Ala Ala Gly Gly Val Arg Val Gin Met Arg Val Ser Ala 
3585 3590 3595 3600 

Ala Asp Glu Ser Gly Arg Arg Arg Val Ala He His Ser Ala Pro Glu 
3605 3610 3615 

Ala Ala Val His Ser Ala Ala Glu Gly Gly Asp Ser Ala Gly Val Trp 
3620 3625 3630 

Thr Arg His Gly Glu Gly Thr Leu Val Pro Asp Pro Glu Pro Thr Pro 
3635 3640 3645 

Pro Asp Ala Asp Trp Ala Arg Ala Trp Pro Pro Ala Gly Glu Arg Val 
3650 3655 3660 

Glu Pro Ala Glu Leu Tyr Glu Arg Phe Gly Ala Leu Gly Tyr Glu Tyr 
3665 3670 3675 3680 

Gly Glu Ala Phe Ala Gly Val Arg Ala Val Trp Arg Gin Pro Asp Ala 
3685 3690 3695 
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20 



Leu Leu Ala Glu Val Leu Leu Pro Asp Arg Ala Ser Thr Gly Ala Gly 
3700 3705 3710 

Arg Phe Gly Val His Pro Ala Leu Leu Asp Ala Ala Leu Gin Pro Trp 
3715 3720 3725 

He Ala Gly Gly Leu Leu Glu Val Pro Glu Asp Ala Val Leu Leu Pro 
3730 3735 3740 

Phe Ala Trp Gin Gly Val Ser Leu Tyr Ala Thr Gly Ala Gly Ala Leu 
3745 3750 3755 3760 

Arg Val Arg Leu Thr Lys Ala Gly Asp Gly Ala Val Ser Leu Gin Ala 
3765 3770 3775 

Ala Asp Thr Ser Gly Ala Ala Val Leu Ser Leu Gly Ala Leu Val Met 
3780 3785 3790 

Arg Pro Leu Ala Arg Arg Lys Leu Asp Val Leu Leu Gly Thr Asp Ala 
3795 3800 3805 

Gly Giu Arg Ser Leu Tyr Arg Val Glu Trp Gin Pro Arg Leu Leu Pro 
3810 3815 3820 

Ala Gly Pro Pro Arg Ser Trp Ala Val Leu Gly Pro Asp Ala Asp Arg 
3825 3830 3835 & 3840 

Leu Ala Gly Thr Pro Gly Leu Gly Asp Gin Pro Asp Gly Gly Pro Thr 
3845 3850 " 3855 

Ala Leu Tyr Pro Glu Val Arg Ala Leu Arg Lys Ala Leu Ala Ala Gly 
3860 3865 3870 

Ala Pro Arg Pro Glu Ala Val Val Leu Pro Val Leu Ser Gly Ala Gly 
3875 3880 3885 

Ala Thr Pro Glu Ser Val Arg Gin Thr Thr Glu Arg Cys Leu Thr Ala 
35 3890 3895 3900 

Leu Gin Asp Trp Leu Asp Ala Glu Glu Leu Val Asp Thr Pro Leu He 
3905 3910 3915 3920 

40 Val Leu Thr Arg Gly Ala Val Ala Ala Val Pro Gly Glu Glu He Gly 

3925 3930 3935 

Asp Leu Ala Cys Ala Gly Val Trp Gly Leu Val Arg Ser Ala Arg Ser 
3940 3945 3950 



25 



30 



45 



50 



55 



Glu His Pro Gly Arg Phe Ala Leu Val Asp Thr Asp Gly His Pro Asp 
3955 3960 3965 

Asp Arg Thr Ala Leu Pro Leu Ala Leu Arg Ala Val Leu Asp Gly Ala 
3970 3975 3980 

Gly Gin Leu Ser Leu Arg Ala Gly Thr Ala Arg Thr Pro Val Leu Leu 
3985 3990 3995 4000 

Arg Ala Gly Thr Pro Glu Glu Gin Arg Gly Pro Ala Phe Asp Pro Ala 
4005 4010 4015 
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Gly Thr Val Leu Val Thr Gly Ala Thr Gly Thr Leu Gly Arg Leu Leu 
4020 4025 4030 

Ala Arg His Leu Ala Ala Glu His Gly Val Arg His Leu Leu Leu Leu 
4035 4040 4045 

Ser Arg Gly Gly Arg Ala Ala Glu Gly Ala Asp Glu Leu Ala Ala Glu 
4050 4055 4060 

Leu Ala Gly Leu Glu Ala Glu Pro Cys Phe Ala Ala Cys Asp Ala Ala 
4065 4070 4075 4080 

Asp Arg Glu Ala Leu Ala Arg Val Leu Ala Glu Val Pro Ala Asp Arg 
4085 4090 4095 

Pro Leu Thr Gly Val lie His Ala Ala Gly Val Leu Asp Asp Gly Thr 
4100 4105 4110 

Leu Asp Ala Leu Thr Pro Glu Arg lie Gly Thr Val Met Arg Pro Lys 
4115 4120 4125 

Ala Asp Ala Ala Leu Asn Leu His Glu Leu Thr Arg Thr Ser Pro Leu 
4130 4135 4140 

Ser Val Phe Ala Val Phe Ser Gly Ala Ala Gly He Leu Gly Arg Pro 
4145 4150 4155 4160 

Gly Gin Ala Asn Tyr Ala Ala Ala Asn Thr Phe Leu Asp Ala Leu Ala 
4165 4170 4175 

Gin His Arg Arg Ala His Gly Leu Pro Ala Val Ser Leu Ala Trp Gly 
4180 4185 4190 

Leu Trp Gly Gly Ala Thr Gly Met Thr Gly His Leu Ser Gly Thr Asp 
4195 4200 4205 

Leu Arg Arg Met Arg Arg Ser Gly He Ala Pro Met Thr His Asp Gin 
4210 4215 4220 

Gly Leu Ala Leu Phe Asp Arg Ala Leu Ala Ala Ser Ala Glu Asp Pro 
4225 4230 4235 4240 

Leu Leu Val Pro Met Arg Leu Asp Leu Ala Ala Leu Val Arg Glu Arg 
4245 4250 4255 

Ala Glu His Gly Pro Asp Ala Val Pro Gly Pro Leu Leu Gly Leu Leu 
4260 4265 4270 

Pro Ala Arg Ala Ala Val Arg Gin Ala Ala Ala Pro Val Arg Gly Gly 
4275 4280 4285 

Ala Pro Ala Pro Ala Gly Gly Glu Gly Thr Ala Glu Arg Leu Ala Gly 
4290 4295 4300 

Leu Gly Glu Glu Ala Arg Leu Arg Glu Leu Val Arg Leu Val Arg Ala 
4305 4310 4315 4320 

Glu Val Ser Gly Val Leu Gly Tyr Ser Gly Pro Asp Ala Val Glu Pro 
4325 4330 4335 
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Gly Arg Pro Phe Lys Asp Leu Gly Phe Asp Ser Leu Thr Ala Val Glu 
4340 4345 4350 

Leu Arg Asn Arg Leu Gly Ala Ala Thr Gly Leu Arg Leu Pro Thr Ala 
4355 4360 4365 

Leu Val Phe Asp Arg Pro Thr Ser Gin Ala Val Ala Glu Tyr Leu Ala 
4370 4375 4380 

Ala Glu Leu Ala Gly Pro Arg Asp Gly Gly Asp Thr Ala Ala Ala Ala 
4385 4390 4395 4400 

Phe Glu Gly Leu Glu Ala Leu Ala Ala Ala Val Gly Ala Leu Ala Glu 
4405 4410 4415 

Asp Asp Leu Arg Arg Asp Val Leu Arg Arg Arg Leu Thr Glu Leu Ala 
4420 4425 4430 

Ala Ala Leu Thr Pro Gin Gly Arg Asn Pro Ser Ala Pro Ala Pro Ala 
4435 4440 4445 

Pro Ser Asp Leu Asp Glu Arg Leu Asp Ser Ala Asn Asp Asp Asp Leu 
4450 4455 4460 

Phe Ala Phe lie Glu Glu Gin Leu * 
4465 4470 



(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1865 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

Met Thr Ala Glu Leu Val Ala Thr Arg Lys Arg Leu Gly Ala Leu Glu 
1 5 10 15 

Glu Arg Ala Arg Glu Pro He Ala Val Val Ala Met Ser Cys Arg Tyr 
20 25 30 

Pro Gly Gly Val Thr Thr Pro Glu Asp Leu Trp Arg Leu Leu Ala Asp 
35 40 45 

Glu Arg Asp Ala Val Ser Gly Leu Pro Arg Asp Arg Gly Trp Asp Leu 
50 55 60 

Asp Ala Leu Tyr Asp Pro Asp Gly Gly Pro Gly Thr Ser Tyr Ala Arg 
65 70 75 80 

Glu Gly Gly Phe Leu Ser His Cys Ala Gly Phe Asp Ala Glu Phe Phe 
85 90 95 

Gly He Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu 
100 105 HO 
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Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly Val Thr Ala 
115 120 125 

Asp Arg Ala Arg Gly Ser Arg Thr Gly Val Tyr Ala Gly Val Met Tyr 
130 135 140 

Asp Asp Tyr Gly Ala Arg Val Leu Tyr Gly Ala Gly Ala Gly Pro Pro 
145 150 155 160 

Glu Asp Leu Glu Gly Tyr Leu Val Asn Gly Ser Ala Gly Ser lie Ala 
165 170 175 

Ser Gly Arg Val Ser Tyr Thr Phe Gly Leu Arg Gly Pro Ala Val Thr 
180 185 190 

Val Asn Thr Ala Cys Ser Ser Ser Leu Val Ser Leu His Leu Ala Val 
195 200 205 

Arg Ala Leu Arg Asn Gly Glu Cys Asp Met Ala Leu Ala Gly Gly Ala 
210 215 220 

Thr Val Leu Ser Thr Pro Thr Val Leu Val Asp Phe Ser Arg Gin Arg 
225 230 235 240 

Gly Leu Ala Pro Asp Gly Arg Cys Lys Ala Phe Ala Asp Ser Ala Asp 
245 250 255 

Gly Thr Ser Trp Ala Glu Gly Ala Gly Met Leu Leu Leu Gin Arg Leu 
260 265 270 

Ser Asp Ala Arg Arg Glu Gly Arg Pro Val Leu Ala Val He Arg Gly 
275 280 285 

Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn 
290 295 300 

Gly Arg Ala Gin Arg Gin Val He Glu Asp Ala Leu Arg Asp Ala Gly 
305 ~ 310 315 320 

Val Gly Pro Asp Gin Val Asp Ala Val Glu Ala His Gly Thr Gly Thr 
325 330 335 

Glu Leu Gly Asp Pro He Glu Ala Gly Ala Leu Leu Ala Thr Tyr Gly 
340 345 350 

Thr Ala Arg Thr Ala Glu Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser 
355 360 365 

Asn He Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val He Lys 
370 375 380 

Met Val Leu Ala Met Arg His Gly Arg Leu Pro Arg Thr Leu His Val 
385 390 395 400 

Asp Arg Pro Thr Thr Arg Val Asp Trp Glu Lys Gly Gly Val Arg Leu 
405 410 415 



Leu Thr Glu Pro Val Pro Trp Pro Gly Glu Ala Gly Glu Pro Arg Arg 
420 425 430 
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Ala Gly Val Ser Ser Phe Gly Ala Ser Gly Thr Asn Ala His Val Val 
435 440 445 

Leu Glu Ser Val Pro Ala Gly Glu Pro Pro Ala Ala Gly Arg Pro Glu 
450 455 460 

Asp Thr Gly Gly Ala Trp Thr Val Ser Gly Arg Gly Pro Ala Ala Leu 
465 470 475 480 

Arg Ala Gin Ala Ala Arg Leu Tyr Asp Ala Leu Thr Gly Thr Gly Thr 
485 490 " 495 

Gly Thr Gly Gin Gly Ala Gly Gin Gly Ala Gly Pro Gly Thr Ala Glu 
500 505 510 

Val Ala Gly Ala Leu Ala His Ala Arg Thr Ala Phe Arg His Arg Ala 
515 520 525 

Val Val Leu Gly Gly Asn Arg Ala Glu Leu Leu Ala Gly Leu Arg Glu 
530 535 540 

Leu Ala Glu Glu Glu His Pro Gly Pro Arg Val Val Thr Gly Thr Ala 
545 550 555 560 

Pro Ala Thr Glu Arg Arg Thr Ala Phe Leu Phe Ser Gly Gin Gly Ser 
565 570 575 

Gin Arg Ala Gly Ser Gly Arg Gly Leu Tyr Arg Arg His Pro Val Phe 
580 585 590 

Ala Arg Ala Leu Asp Glu Val Cys Ala Ala Leu Glu Pro His Leu His 
595 600 605 

Arg Pro Leu Arg Asp Leu Met Phe Ala Glu Pro Gly Ser Pro Glu Ala 
610 615 620 

Glu Pro Leu Asp Arg Thr Glu Phe Thr Gin Pro Ala Leu Phe Ala Leu 
625 630 635 640 

Gin Thr Ala Leu Phe Arg Leu Ala Glu His His Gly Leu Arg Ala Glu 
645 650 655 

Ala Leu Cys Gly His Ser Val Gly Glu He Ala Ala Ala His Ala Ala 
660 665 670 

Gly Val Leu Thr Leu Pro Asp Ala Ala Arg Leu Val Ala Ala Arg Gly 
675 680 685 

Arg Leu Met Gin Ala Leu Pro Ala Gly Gly Ala Met Ala Ala Leu Arg 
690 695 700 

Ala Thr Ala Glu Glu He Ala Pro Leu Leu Glu Arg Arg Ala Gly Glu 
705 710 715 720 

Leu Ala Leu Ala Ala Val Asn Gly Pro Ser Ser Val Val Val Ser Gly 
725 730 735 



Asp Glu Ala Ala Val Leu Glu Leu Leu Glu Gin Trp Arg Ala Glu Gly 
740 745 ' 750 
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Arg Glu Ala Arg Arg Leu Ala Val Ser His Ala Phe His Ser Pro Arg 
755 ~ " 760 765 

Met Asp Gly Met Leu Thr Gin Phe Asp Arg Val Ala Arg Thr Leu Thr 
770 775 780 

Phe Ala Pro Pro Thr He Pro Leu Val Ser Thr Leu Thr Gly Thr Pro 
785 790 795 800 

Val Thr Glu Glu Thr Leu Cys Thr Ala Asp His Trp Val Arg Gin Ala 
805 810 815 

Arg Glu Pro Val Arg Phe Leu Asp Ala Met Arg Thr Leu Arg Ala Asp 
820 825 830 

Gly He Asp Thr Phe Val Glu Leu Gly Pro Asp Gly Val Leu Ser Ala 
835 840 845 

Met Ala Arg Asp Cys Ala Asp Asp Arg Pro Asp Gly Asp Thr Thr Gly 
850 855 860 

Ala Gly Asp Gly Glu Thr Pro Asp Pro Leu Leu Thr Leu Pro Leu Leu 
865 870 875 880 

Arg Arg Ser Val Pro Glu Thr Gly Asp Ala Glu His Pro Gly Gly Phe 
885 890 895 

Glu Arg Ala Leu Ala Thr Ala Tyr Ala His Gly Val Pro Leu Arg Leu 
900 905 910 

Ala Pro Ala Pro Asp Ala Ala Ser Leu Ala Val Ala Ala Glu Leu Pro 
915 920 925 

Thr Tyr Ala Phe Gin Arg Thr His Tyr Trp Leu Asp Ala Pro Ala Ala 
930 935 940 

Pro Ala Ala Leu Pro Ala Gly Leu Asp Asp Ala Gly His Pro Leu Leu 
945 950 955 960 

Ser Ala Ala Leu Asp Leu Pro Gly Gly Arg Gly Thr Val Trp Thr Gly 
965 970 975 

Ala Leu Ser Ala Ala Thr Leu Pro Trp Ala Ala Asp His Ser Val His 
980 985 990 

Gly Arg Thr Val Leu Pro Gly Thr Ala Leu Leu Asp Leu Ala Leu His 
995 1000 1005 

Ala Ala Pro Arg Val Gly Glu Leu Thr Phe Glu Ala Pro Leu Val Leu 
1010 1015 1020 

Pro Glu Asp Gly Glu Val Arg Leu Arg Val Val Leu Ala Glu Pro Asp 
1025 1030 1035 1040 

Ala Ser Gly Val Arg Glu Leu Ser Val His Ser Ala Gly Glu Asp Gly 
1045 1050 1055 

Gly Trp Thr Arg His Ala Thr Ala Val Leu Asp Thr Gly Thr Thr Thr 
1060 1065 1070 
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Ala Gly Glu Pro Ala Gly Ala Pro Pro Ala Ala Trp Pro Pro Gly Asp 
1075 1080 1085 

Ala Glu Pro Leu Asp Leu Ala Ala Glu Tyr Glu Arg Phe Ala Asp Ala 
5 1090 1095 1100 

Gly lie Gly Tyr Gly Pro Ala Phe Arg Gly Leu Arg Ser Ala Trp Arg 
1105 1110 1115 1120 

10 Asp Gly Asp Ala lie Leu Ala Asp Val Arg Leu Pro Gly Glu Leu Ala 

1125 1130 1135 

Gly Glu Ala Asp Arg Tyr Gly lie His Pro Ala Leu Leu Asp Ala Ala 
1140 1145 1150 

15 Leu His Thr Ala Ala Ala Ala Leu Gly Gly Ala His Gly Met Leu Pro 

1155 1160 1165 

Phe Thr Trp Asn Gly Val Thr Leu His Ala Arg Gly Ala His Ala lie 
1170 1175 1180 

Arg Val Arg Leu Thr Pro Ala Gly Pro Asp Ala Val Ala Val Thr Ala 
1185 1190 1195 1200 

Val Asp Pro Ala Gly Arg Pro Val Phe Thr Ala Ala Ser Leu Thr Leu 
1205 1210 1215 

Arg Pro Val Thr Thr Gly Gin Leu Thr Ala Ala Glu Ala Ala Arg Ala 
1220 1225 1230 

Pro Leu Tyr Arg Val Arg Trp Thr Gly Leu Pro Asp Thr Gly Thr Ala 
30 1235 1240 1245 

Arg Asp His Thr Trp Ala Val Ala Gly Gly Pro Gly Asp Leu Leu Pro 
1250 1255 1260 



20 



25 



35 



Gly Glu Thr Pro His His Pro Asp Leu Ala Ser Ala Leu Ala Asp Thr 
1265 1270 1275 1280 

Gly Thr Ala Pro Phe Arg Val Leu Ala Asp Leu Arg Gly Tyr Gly Thr 
1285 1290 " * 1295 

40 Ala Thr Pro Arg Glu Leu Ala Ser Gin Ala Leu Ala Leu Val Gin Gin 

1300 1305 1310 

Trp Ala Asp Ala Ala Glu Ala Ala Glu Gly Arg Leu Val Leu Val Thr 
1315 1320 1325 

45 

Arg Arg Ala Val Asp He Gly Asp Gly Val Thr Asp Pro Ala Ala Ala 
1330 1335 1340 

Thr Val Trp Gly Leu Val Arg Ala Ala Gin Ser Glu His Pro Gly Cys 
1345 1350 1355 1360 

so 

Phe Ala Leu Leu Asp Thr Asp Asp Ser Pro Arg Ser Arg Gin Leu Leu 

1365 1370 ~ " 1375 

Pro Arg Val Ala Gly Thr Ala Glu Gin Leu Ala Leu Arg Asp Gly Thr 
55 1380 1385 1390 
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Leu Leu Ala Pro Ser Leu Thr Arg Ala Thr Leu Pro Ala Gly Ala Arg 
1395 1400 1405 

Leu Pro Ala Leu Asp Gly Thr Val Leu He Thr Gly Gly Thr Gly Ser 
1410 1415 1420 

Leu Gly Ala Glu Ala Ala Arg His Leu Val Thr Arg His Gly Ala Arg 
1425 1430 1435 1440 

Arg Leu Leu Leu Thr Ser Arg Ser Gly Pro Gin Ala Pro Gly Ala Ala 
1445 1450 1455 

Glu Leu Val Ala Glu Leu Ala Ala Leu Gly Ala His Ala Asp Val Ala 
1460 1465 1470 

Ala Cys Asp Val Ala Asp Arg Ala Ala Leu Arg Ala Leu Leu Asp Arg 
1475 1480 14B5 

Val Pro Ala Gly His Pro Leu Thr Ala Val Leu His Thr Ala Gly Val 
1490 1495 1500 

Leu Asp Asp Gly Val Leu Thr Ala Gin Thr Pro Gin Arg Leu Ala Ala 
1505 " * 1510 1515 1520 

Val Leu Arg Pro Lys Ala Asp Ala Val Arg Asn Leu His Glu Leu Thr 
1525 1530 1535 

Gin Gly His Ala Leu Ser Ala Phe He Leu Tyr Ser Ser Ala Ala Gly 
1540 1545 1550 

Val Leu Gly Ser Ala Gly Gin Ser Gly Tyr Ala Ala Ala Asn Ala Tyr 
1555 1560 1565 

Leu Asp Ser Phe Ala Val Trp Arg Arg Ser Arg Gly Leu Pro Ala Val 
1570 1575 1580 

Ser Leu Gly Trp Gly Pro Trp Asp Gly Gly Gly Met Ala Ser Gly Leu 
1585 1590 1595 1600 

Gly Gly Thr Asp Thr Ala Arg Leu Arg Arg Ser Gly He Ala Pro Leu 
1605 1610 1615 

Ser Arg Ala Glu Gly Leu Ala Ala Leu Asp Ala Ala Leu Ala Ala Gly 
1620 1625 1630 

Gly Asp Asp Thr Ala Pro Ala His Leu Leu Pro He Arg Val Asp Ala 
1635 1640 1645 

Val Thr Leu Arg Gly Ala Asp Thr Val Pro Ala Val Leu Arg Asp Leu 
1650 1655 1660 

Ala Gly Thr Ala Pro Ser Ala Ala Glu Arg Pro Pro Gly Thr Pro Glu 
1665 1670 1675 1680 

Asp Thr Asn Ala Pro Leu Ala Asp Val Thr Gin Leu His Gly Arg Glu 
1685 1690 1695 

Arg Lys Glu Ala Leu Thr Gly Phe Val Arg Ala Gin Val Ala Ala Val 
1700 * 1705 1710 
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Leu Gly His Pro Thr Ser Asp Thr lie Asp Val Arg Arg Ser Phe Lys 
1715 1720 1725 

Glu Ala Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu 
1730 1735 1740 

Arg Ala Ala Thr Gly Leu Lys Leu Pro Ala Thr Leu Val Phe Asp His 
1745 . 1750 1755 1760 

Pro Thr Pro Leu Ala Leu Ala Gly Phe Leu His Arg Glu Leu Pro Gly 
1765 1770 1775 

Ala Glu Ala Ser Leu Met Ser Ala lie Asp Thr Leu Arg His Arg Leu 
1780 1785 1790 

Arg Asp Ala Leu Ala Asp Asp Ala Ala Asp Asp Ala Leu Arg Asp Gin 
1795 1800 1805 

lie Thr Arg Arg Leu Glu Thr Leu Leu Ala Gly He Ala Arg Thr Glu 
1810 1815 1820 

Glu Pro Ala Pro Ala Thr Ala Ala Ala Asp Asp Gly Ser Gly Ala Gly 
1825 1830 1835 1840 

Asp Val Ala Glu Arg Leu Ser Thr Ala Ser Asp Asp Glu Leu Phe Glu 
1845 1850 1855 

Leu Leu Asp Ser Gly Phe Thr Pro * 

1860 1865 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3730 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Ser Thr Glu Asn Ser Thr Asn Val Pro Ala Ser Glu Asp Lys Leu 
15 10 15 

Arg Ala Tyr Leu Arg Arg Ala Met Ala Asp Leu His Glu Ser Arg Glu 
20 25 30 

Arg Leu Arg Ala Thr Glu Ala Arg Ala Gin Glu Pro lie Ala Val Val 
35 40 45 

Gly Met Gly Cys Arg Phe Pro Gly Gly Val Gly Ser Pro Glu Ala Leu 
50 55 60 

Trp Arg Leu Val Val Glu Gly Val Asp Ala Val Ser Pro Phe Pro Gly 
65 70 75 80 

Asp Arg Gly Trp Asp Val Glu Gly Leu Tyr Asp Pro Glu Pro Gly Val 
85 90 95 
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Ala Gly Lys Ser Tyr Val Arg Glu Gly Gly Phe Leu His Asp Ala Ala 
100 105 110 

Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala Val Ala 
115 120 125 

Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala He 
130 135 140 

Glu Arg Ala Gly He Asp Pro His Ser Leu His Gly Ser Arg Thr Gly 
145 150 155 160 

Val Tyr Ala Gly Val Met Tyr His Asp Tyr Gly Thr Gly Gin Thr Ser 
165 170 175 

Ala Thr Asp Thr Ser Gly Tyr Ser Gly Thr Gly Thr Ser Gly Ser Val 
180 185 190 

Val Ser Gly Arg Val Ala Tyr Thr Leu Gly Leu Glu Gly Pro Ala Val 
195 200 205 

Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala 
210 215 220 

Val Gin Ala Leu Arg Gly Gly Glu Cys Asp Met Ala Leu Ala Gly Gly 
225 230 235 240 

Val Thr Val Met Ala Gly Pro Gly Met Phe Val Glu Phe Ser Arg Gin 
245 250 255 

Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe Ala Asp Gly Ala 
260 265 270 

Asp Gly Thr Ala Trp Ala Glu Gly Ala Gly Val Val Leu Val Glu Arg 
275 280 285 

Leu Ser Asp Ala Arg Arg Leu Gly His Pro Val Leu Ala Val Val Cys 
290 295 300 

Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro 
305 310 315 320 

Ser Gly Pro Ser Gin Glu Arg Val He Arg Gin Ala Leu Ala Asn Ala 
325 330 335 

Arg Leu Thr Val Ala Asp Val Asp Val Val Glu Ala His Gly Thr Gly 
340 345 350 

Thr Arg Leu Gly Asp Pro He Glu Ala Gin Ala Leu Leu Gly Thr Tyr 
355 * 360 365 

Gly Arg Asp Arg Asp Gly Gly Arg Pro Val Trp Leu Gly Ser Leu Lys 
370 375 380 

Ser Asn He Gly His Ala Gin Ala Ala Ala Gly Val Ala Gly Val He 
385 390 395 400 



Lys Met Val Leu Ala Met Arg Tyr Gly Trp Leu Pro Arg Thr Leu His 
405 410 415 
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Val Asp Glu Pro Ser Arg His Val Asp Trp Ser Ala Gly Gly Val Trp 
420 425 430 

Leu Leu Thr Glu Ala Arg Glu Trp Pro Gly Val Asp Arg Pro Arg Arg 
435 440 445 

Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala His Leu He 
450 455 460 

Leu Glu Ala Pro Asp Thr Ala Glu Ala Glu Ser Ala Thr Thr Pro Val 
465 470 475 480 

Arg Ser Glu Val Ser Glu Ser Ala Ala Val Phe Asp Ala Arg Ser Gly 
485 490 495 

Val Val Pro Val Val Val Ser Gly Arg Ser Arg Val Val Val Arg Glu 
500 505 510 

Ala Ala Gly Arg Leu Ala Glu Val Val Glu Ala Gly Gly Val Gly Leu 
515 520 525 



Ala Asp Val Ala Val Thr Met Ala Gly Arg Ser Arg Phe Gly TVr Ara 
530 535 540 

Ala Val Val Leu Ala Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg 
545 550 555 560 

Ala Leu Ala Gly Gly Asp Pro Asp Ala Gly Val Val Thr Gly Ala Val 
565 570 575 

Val Asp Pro Glu Thr Gly Ser Gly Gly Gly Gly Val Val Leu Val Phe 
580 585 590 

Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly Leu Leu Gly 
595 600 605 

Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala Arg Ala Leu 
610 615 620 

Ser Val His Val Gly Trp Asp Leu Leu Glu Val Val Ser Gly Gly Ala 
625 630 635 4 640 

Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr Trp Ala Val Met 
fi 45 650 655 

Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp Val Ala Ala 
660 665 670 

Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala Thr Val Ala Gly 
675 680 685 

Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu Arg Ala Gly 
690 695 700 

Leu He Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala Ala Val Pro 
705 710 715 720 

Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp Pro Gly Val 
725 730 735 
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Glu Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val Ser Gly Asp 
740 745 750 

Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala Glu Gly Val 
755 760 765 

Gin Ala Arg Leu lie Pro Val Asp Tyr Ala Ser His Ser Arg His Val 
770 775 780 

Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly lie Arg Pro 
785 790 795 800 

Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly Glu Gin Pro 
805 810 815 

Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn 
820 825 830 

Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu Glu Gly His 
835 840 845 

Arg Arg Phe He Glu Val Ser Ala His Pro Val Leu Val His Ala lie 
850 855 860 

Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala Thr Gly Thr 
865 870 875 880 

Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr Ser Thr Ala 
885 890 895 

Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro 
900 905 910 

Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn His His His 
915 920 925 

Tyr Trp Leu Asp Thr Thr Pro Thr Thr Pro Ala Thr Thr Thr Gin Ser 
930 935 940 

Pro Thr Asp Ala Gin Asn Pro Ala Asp Ala Leu Pro Tyr Lys Val Ser 
945 950 955 960 

Trp Lys Arg Leu Arg Asp Gin Asp Ser Leu Thr Ala Arg Leu Asp Gly 
965 970 975 

Arg Trp Leu Leu Val Val Pro Glu Ala Ser Ala Asp Pro Ser Val Ala 
980 985 990 

Glu Gly Val Ala Arg Glu Leu Thr Ala Arg Gly Ala Thr Val Glu Ser 
995 1000 1005 

Leu Thr Val Glu Pro Gly Ala Asp Arg Ser Arg Leu Arg Gly Leu Leu 
1010 1015 1020 

Val Asp Ala Thr Glu Arg Asp Glu Ala Gly Pro Leu Arg Gly He Val 
1025 1030 1035 1040 

Ser Leu Leu Ala Leu Ala Gly Asp His Ala Gly Ala Asp Gly Ala Arg 
1045 1050 1055 
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70 



75 



Pro Val Val Pro Ala Gly Leu Ala Ala Ser Leu Ala Leu lie Gin Ala 
1060 1065 1070 

Ala Gly Asp Ala Gly Thr Glu Ala Gly Leu Trp Ala Val Thr Arg Gly 
1075 1080 1085 

Ala Val Ala Ala Val Pro Gly Asp Val Pro Ala Pro Ser Gin Ala Leu 
1090 1095 1100 

Leu Trp Gly Phe Gly Arg Val Ala Gly He Glu Leu Pro His Cys Trp 
1105 ~ 1110 1115 1120 

Gly Gly Leu Leu Asp Leu Pro Thr Gly Pro Gly Asp Ser Gly Phe Arg 
1125 1130 1135 

Gin Leu Ala Ala Thr Leu Ala Gly Arg Pro Ala Glu Asp Gin Val Ala 
1140 1145 1150 

Leu Arg Ala Ser Gly Ala Tyr Gly Arg Arg Leu Val Arg Ala Ser Ala 
20 1155 1160 1165 

Ala Gly Gly Ala Asp Gly Trp Arg Pro Arg Gly Thr Val Leu Val Val 
1170 1175 1180 

Gly Asp Thr Ala Glu Val Ala Gly Pro Leu Val Arg Trp Leu Leu Gly 
25 1185 1190 1195 1 2 0 0 

Asn Gly Ala Arg Arg Val Thr Leu Ser Gly Leu Ser Gly Pro Leu Pro 
1205 1210 1215 

30 Glu Glu Leu Ala Asp Val Ala Ala Arg Val Thr Val Ala Pro Cys Asp 

1220 1225 1230 

Pro Ala Asp Arg Pro Ala Leu Arg Thr Leu Leu Ala Glu Gin Ala Pro 
1235 '1240 1245 

35 Thr Ala Val Leu Val Ala Pro Pro Ala Val Pro Pro Thr Pro Leu Ala 

1250 1255 1260 

Glu Met Thr Ala Glu Ala Leu Ala He Ala Leu Ser Ala Lys Thr Gly 
1265 1270 1275 1280 

Leu Val Asp Arg Leu Asp Ser Leu Leu Asp Glu Pro Asp Pro Leu Leu 
1285 1290 1295 

Glu Asp Gly Glu Leu Asp Ala Phe Val Val Phe Ser Ser Val Ala Gly 
1300 1305 1310 

Val Trp Gly Gly Ala Gly Gin Gly Gly Tyr Ala Ala Gly Thr Ala Tyr 
1315 1320 1325 

Leu Asp Ala Leu Ala Glu Cys Arg Arg Ala Gly Gly Leu Pro Val Thr 
50 1 3 3 0 13 3 5 13 4 0 

Ser Val Ala Trp Thr Pro Trp Leu Gly Thr Pro Ala Ala Asp Ser Leu 
1345 1350 1355 1360 

55 Gly Glu Gin Met Ser Arg Ala Gly He Thr Pro Leu Asp Pro Ala Ala 

1365 1370 1375 



40 



45 
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Ser Leu Asp Ala Leu Ala Arg Ala Val Gly Arg Arg Ala Gly Cys Val 
1380 1385 1390 

Thr Val Ala Asp He Asp Trp Glu Arg Phe Ala Ser Ala Tyr Thr Ala 
1395 1400 1405 

Thr Arg Pro Thr Pro Met Phe Asp Glu Val Pro Glu Val Arg Arg He 
1410 1415 1420 

Gin Ala Ala Trp Ala Glu Ala Glu Ala Asp Ala Ala Arg Ser Gly Ala 

1425 1430 1435 1440 

Gly Gly Asp Ser Gin Leu Leu Arg Ser Leu Arg Gly Arg Pro Glu Glu 
1445 1450 1455 

Ala Gin Leu Ala Glu Leu Leu Arg Leu Val Arg Thr His Ala Ala Ala 
1460 1465 1470 

Val Leu Gly Leu Gly Ser Pro Gly Ala Val Glu Ala Arg Arg Ser Phe 
1475 1480 1485 

Lys Asp Leu Gly Phe Asn Ser Val Thr Ala Val Glu Leu Arg Asn Arg 
1490 " 1495 1500 

Leu Lys Glu Ala Thr Gly Leu Arg Leu Glu Val Ser Leu Val Phe Asp 

1505 1510 1515 1520 

His Pro Asp Pro Ala Ser Leu Ala Arg His Leu Leu Asp Leu Ala Leu 
1525 1530 1535 

Gly Gin Glu Pro Glu Glu Thr Pro Arg Ala Phe Ala Leu Glu Pro Ala 
1540 1545 1550 

Pro Asn Gly Glu Pro He Ala He Val Ser Met Ala Cys Arg Met Pro 
1555 1560 1565 

Gly Gly Val Ser Thr Pro Glu Glu Leu Trp Arg Leu Leu Arg Asp Gly 
1570 1575 1580 

Lys Asp Ala He Gly Pro Phe Pro Ala Asn Arg Gly Trp Asp Leu Glu 

1585 1590 1595 1600 

Asn Leu Tyr Asp Pro Asp Pro Asp Ala Asp Gly Arg Thr Tyr Val Arg 
1605 1610 1615 

Glu Gly Gly Phe Leu His Glu Ala Pro Asp Phe Asp Pro Ser Phe Phe 
1620 1625 1630 

Gly He Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu 
1635 1640 1645 

Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly He Asp Pro 
1650 1655 1660 

Ala Arg Leu Arg Gly Ser Arg Thr Gly Val Phe Val Gly Thr Asn Gly 

1665 1670 1675 1680 

Gin His Tyr Met Pro Leu Leu Gin Asn Gly Gly Asp Ser Phe Asp Gly 
1685 1690 1695 
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Tyr Leu Gly Thr Cly Asn Ser Ala Ser Val Met Ser Cly Arg Leu Ser 
1700 1705 1710 

Tyr Val Phe Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys 
1715 1720 1725 

Ser Ala Ser Leu Val Ala Leu His Leu Ala Val Gin Ala Met Arg Arg 
1730 1735 1740 

Gly Glu Cys Asp Met Ala Leu Val Gly Gly Ala Thr Val Met Ser Thr 
1745 1750 1755 1760 

Pro Glu Met Leu Val Glu Phe Ser Arg Gin Arg Val He Ser Ala Asn 
1765 1770 1775 

Gly Arg Ser Arg Ala Phe Ala Ala Gly Ala Asp Gly Val Ala Leu Gly 
1780 1785 1790 

Glu Gly Val Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Glu Arg 
1795 1800 ^ 1805 

Asn Gly His Pro Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin 
1810 1815 1820 

Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gin Gin 
1825 1830 1835 1840 

Arg Val He Arg Gin Ala Leu Ala Asp Ala Gly Leu Arg Pro Glu Asp 
1845 1850 " 1855 

He Asp Ala Val Glu Ala His Gly Thr Gly Thr Glu Leu Gly Asp Pro 
1860 1865 ~ 1870 

He Glu Ala Glu Ala Leu Leu Ala Thr Tyr Gly Arg Thr Arg Thr Ala 
1875 1880 1885 

Asp Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Thr 
1890 1895 1900 

Gin Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val Leu Ala Leu 
1905 1910 1915 1920 

Gly Asn Glu Thr Leu Pro Arg Thr Leu His Val Asp Glu Pro Thr Pro 
1925 1930 * 1935 

Arg Val Asp Trp Ser Ser Gly Ala Val Ser Leu Leu Thr Glu Pro Val 
1940 1945 1950 

Asp Trp Pro Ala Gly Pro Ser Ala Pro Arg Arg Ala Ala Val Ser Ser 
1955 i960 1965 

Phe Gly He Ser Gly Thr Asn Ala His Thr He Leu Glu Gin Ala Pro 
1970 1975 1980 

Val Pro Ala Glu Ser Arg Pro Gly Thr Glu Pro Ala Asp Gly Thr Gly 
1985 1990 1995 2000 

Ala Trp Glu Asn Val Thr Val Pro Leu Leu Leu Ser Gly His Thr Glu 
2005 2010 2015 
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Ala Ala Leu Arg Glu Gin Ser Thr Arg Leu Leu Asn Asp Leu Leu Glu 
2020 2025 2030 

His Pro Asp Glu His Pro Ala Asp Val Gly Tyr Thr Leu He Thr Gly 
2035 2040 2045 

Arg Ala His Phe Gly His Arg Ala Ala Val He Gly Glu Ser Arg Glu 
2050 2055 2060 

Glu Leu Leu Asp Ala Leu Lys Ala Leu Ala Glu Gly Arg Glu His His 
2065 2070 2075 2080 

Thr Val Val Arg Gly Asp Gly Thr Ala His Pro Asp Arg Arg Val Val 
2085 2090 2095 

Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Pro Ser Met Ala Arg Asp 
2100 2105 2110 

Leu Leu Asp Arg Ala Pro Ala Phe Arg Glu Thr Ala Lys Ala Cys Asp 
2115 2120 2125 

Ala Ala Leu Ser Val His Leu Asp Trp Ser Val Leu Asp Val Leu Gin 
2130 2135 2140 

Glu Lys Pro Asp Ala Pro Pro Leu Ser Arg Val Asp Val Val Gin Pro 
2145 2150 2155 2160 

Val Leu Phe Thr Met Met Leu Ser Leu Ala Ala Cys Trp Arg Asp Leu 
2165 2170 2175 

Gly Val His Pro Ala Ala Val Val Gly His Ser Gin Gly Glu He Ala 
2180 2185 2190 

Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Arg He 
2195 2200 2205 

Val Ala Leu Arg Ser Arg Ala Trp Leu Thr Leu Ala Gly Lys Gly Gly 
2210 2215 2220 

Met Ala Ala Val Ser Leu Pro Glu Ala Arg Leu Arg Glu Arg He Glu 
2225 2230 2235 2240 

Arg Phe Gly Gin Arg Leu Ser Val Ala Ala Val Asn Ser Pro Gly Thr 
2245 2250 2255 

Ala Ala Val Ala Gly Asp Val Asp Ala Leu Arg Glu Leu Leu Ala Glu 
2260 2265 2270 

Leu Thr Ala Glu Gly He Arg Ala Lys Pro He Pro Gly Val Asp Thr 
2275 2280 2285 

Ala Gly His Ser Ala Gin Val Asp Gly Leu Lys Glu His Leu Phe Glu 
2290 2295 2300 

Val Leu Ala Pro Val Ser Pro Arg Ser Ser Asp He Pro Phe Tyr Ser 
2305 2310 2315 2320 

Thr Val Thr Gly Ala Pro Leu Asp Thr Glu Arg Leu Asp Ala Gly Tyr 
2325 2330 2335 
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Trp Tyr Arg Asn Met Arg Glu Pro Val Glu Phe Glu Lys Ala Val Arg 
2340 2345 2350 

Ala Leu lie Ala Asp Gly Tyr Asp Leu Phe Leu Glu Cys Asn Pro His 
2355 2360 2365 

Pro Met Leu Ala Met Ser Leu Asp Glu Thr Leu Thr Asp Ser Gly Gly 
2370 2375 2380 

His Gly Thr Val Met His Thr Leu Arg Arg Gin Lys Gly Ser Ala Lys 
2385 2390 2395 2400 

Asp Phe Gly Met Ala Leu Cys Leu Ala Tyr Val Asn Gly Leu Glu lie 
2405 2410 ^ 2415 

Asp Gly Glu Ala Leu Phe Gly Pro Asp Ser Arg Arg Val Asn Pro Pro 
2420 2425 2430 

Thr Tyr Pro Phe Gin Arg Glu Arg Tyr Trp Tyr His Pro Thr Ser Gly 
2435 2440 2445 

Arg Arg Gly Asp He Thr Ala Ala Gly Val Ala Glu Ala Glu His Pro 
2450 2455 2460 

Leu Leu Gly Ala Gly Val Glu Leu Pro Glu Thr Gly Gly Thr Val Tyr 
2465 2470 2475 2480 

Thr Ala Arg Phe Gly Pro Asp Ser Arg Pro Trp Leu Ala Asp His Ala 
2485 2490 2495 

Leu Leu Gly Thr Val Leu Leu Pro Gly Thr Ala He Leu Asp Leu Val 
2500 2505 2510 

Leu Trp Ala Gly Glu Arg Ser Gly Cys Gly Arg Val Gly Glu Leu Ala 
2515 2520 2525 

Leu Gin Ala Pro Leu Val Leu Pro Asp Ser Gly Asp Val Glu Leu Arg 
2530 2535 2540 

Leu Leu Val Gly Gly Pro Asp Glu Glu Lys Arg Arg Thr Val Thr Val 
2545 2550 2555 " 2560 

His Ala Arg Pro Ala Ala Ala Gly Ala Glu Ala Pro Trp Thr Arg His 
2565 2570 2575 

Ala Glu Ala Val Val Leu Pro Ala Thr Gly Glu Glu Pro Thr Pro Ala 
2580 2585 * 2590 

Pro Arg Pro Val Pro Glu Pro Ala Gly Thr Thr Asp Pro Ala Ala Phe 
2595 2600 2605 

Tyr Ala Glu Phe Ala Glu Arg Gly Tyr Asp Tyr Gly Pro Ala Phe Gin 
2610 2615 2620 

Gly Phe Thr Ala Gly Ala Arg His Gly Glu Asp Val Val Ala Glu Val 
2625 2630 2635 2640 

Ala Leu Pro Ser Gly Leu Val Ala Asp Ala Arg His His Arg Leu His 
2645 2650 2655 
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Pro Ala Leu Leu Asp Ala Ala Leu Gin Ala Met lie Leu Cly Thr Phe 
2660 2665 2670 

Phe Ala Asp Asp Gly Arg Ala Arg Met Pro Phe Ala Val Arg Gly Val 
2675 2680 2685 

Arg Leu His Thr Ala Gly Ala Asp Arg Leu Arg Val Leu He Ser Pro 
2690 2695 2700 

Ala Gly Asp Glu Thr Val Arg Leu Leu Cys Thr Asp Leu Ala Thr Gly 
2705 2710 2715 2720 

Ala Pro Val Leu Glu He Asp Glu Leu Val Val Arg Pro Val Ser Gly 
2725 2730 2735 

Glu Gin Leu Ala Ala Gly Ala Pro Gly Arg Asn Gly Gly Glu Leu Tyr 
2740 2745 2750 

Arg Val Asp Trp Thr Val Leu Pro Glu Pro Ala Glu Val Pro Ala Pro 
2755 2760 2765 

Arg Trp Ala Leu Leu Gly Glu Asp His Ala Gly Leu Ala Asp Val Leu 
2770 2775 2780 

Gly Gly Thr Gly Gly Gly Cys Glu Arg Tyr Asp Thr Leu Thr Gly Leu 
2785 2790 2795 2800 

Leu Glu Ala Thr Thr Arg Ser Ala Gly Gly He Leu Pro Asp He Val 
2805 2810 2815 

Ala Leu Ser Leu Pro Thr Ala Pro Glu Pro Gly Pro Gin Ala Val Arg 
2820 2825 2830 

Glu Val Leu Ser Gin Ala Leu Asp Ala Ala Gin Ala Trp Leu Ala Ala 
2835 2840 2845 

Gly Ala Glu Thr Ala Ser Ala Arg Leu Val Phe Val Thr Gly Gly Ala 
2850 2855 2860 

Val Ala Thr Thr Ala Asp Glu Thr Val Arg Asp He Ala Ala Ala Ala 
2865 2870 2875 2880 

Val Trp Gly Leu Val Arg Ser Ala Gin Ser Glu Glu Pro Asp Arg Met 
2885 2890 2895 

Val Leu Leu Asp Leu Asp Gly Glu Arg Pro Thr Ala Arg Thr Leu Ala 
2900 " 2905 2910 

Ala Ala Leu Ala Ser Gly Glu Pro Gin Leu Ala Val Arg Gly Ser Thr 
2915 2920 2925 

Val Ala Ala Pro Arg Leu Ala Pro Ala Gly Pro Gly Pro Glu Asp Leu 
2930 2935 2940 

Val Pro Pro Ala Gly Thr Thr Ala Trp Arg Leu Thr Pro Gly Gly Gly 
2945 2950 2955 2960 

Thr Leu Glu Glu Leu Ser Leu Ala Pro Ala Pro Asp Ala Glu Glu Pro 
2965 2970 2975 
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Leu Ala Pro Gly Gin Val Arg He Ala Val Arg Ala Ala Gly Val Asn 
2980 2985 2990 

Phe Arg Asp Ala Leu He Ala Leu Gly Met Tyr Pro Gly Lys Gly Thr 
2995 3000 3005 

Met Gly Ala Glu Gly Ala Gly Val Val Val Glu Thr Ala Pro Asp Val 
3010 3015 3020 

Thr Gly Leu Ser Ala Gly Asp Arg Val Leu Gly Met Trp Asn Gly Gly 
3025 3030 3035 3040 

Phe Gly Pro Leu Val Val Ala Asp His Arg Met Val Ala Pro He Pro 
3045 3050 3055 

His Gly Trp Ser Tyr Ala Glu Ala Ala Ser Val Pro Ala Val Leu Leu 
3060 3065 3070 

Thr Ser Tyr Tyr Ala Leu Thr Arg Leu Ala Arg Ala Arg Thr Gly Gin 
3075 3080 3085 

Thr Val Leu Val His Ala Ala Ala Gly Gly Val Gly Met Ala Thr Leu 
3090 3095 3100 

Gin Leu Ala Arg His Leu Gly Leu Glu Val Tyr Ala Thr Ala Ser Thr 
3105 3110 3115 3120 

Gly Lys Trp Asp Ala Leu Gin Lys His Gly He Pro Asp Asp Arg He 
3125 3130 3135 

Ala Asp Ser Arg Thr Leu Asp Phe Ala Glu Arg Phe Leu Ser Arg Thr 
3140 3145 3150 

Gly Gly Arg Gly Val Asp He Val Leu Asn Ser Leu Ala Gly Glu Phe 
3155 3160 3165 

Val Asp Ala Ser Leu Arg Leu Leu Pro Arg Gly Gly His Phe Leu Glu 
3170 3175 3180 

Leu Gly Lys Ala Asp Val Arg Asp Pro Arg Arg lie Ala Ala Ala His 
3185 3190 3195 3200 

Pro Gly Thr Asp Tyr Arg Ala Phe Asp Leu Val Gin Ala Gly Pro Asp 
3205 3210 3215 

Thr Val Gly Glu Met Leu Gly Glu Leu Leu Glu Leu Phe Ala Ala Gly 
3220 3225 3230 

Ala Leu Arg Pro Leu Pro Leu Thr Ala Tyr Gly He Arg Asp Ala Arg 
3235 3240 3245 

Thr Ala Leu Arg Thr Leu Ser Gin Ala Arg His Thr Gly Lys Leu Val 
3250 3255 3260 

Leu Thr Val Pro Ala Gly Phe Asp Thr His Arg Thr Val Leu Leu Thr 
3265 3270 3275 3280 

Gly Gly Thr Gly Thr Leu Gly Gin Thr Leu Ala Arg His Leu Val Asn 
3285 3290 3295 
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10 



Arg His Gly Val Arg His Leu Leu Leu Ala Gly Arg Thr Gly Ala Ala 
3300 3305 3310 

Ala Glu Gly Val Ala Glu Leu He Gly Glu Leu Gly Glu Leu Gly Ala 
3315 3320 3325 

Glu Val Arg Val Ala Ala Cys Asp Ala Ala Asp Arg Gin Arg Leu Thr 
3330 3335 3340 

Glu Leu Leu Ala Gly He Pro Val Glu His Pro Leu Gly Ala Val Val 
3345 3350 3355 3360 

His Ala Ala Gly Thr Leu Asp Asp Gly Thr He Pro Ser Leu Thr Gly 
3365 3370 3375 

Glu Asn He Asp Asn Val Leu Arg Pro Lys Ala Asp Ala Val Leu Asn 
3380 3385 3390 

Leu His Glu Leu Thr Arg Asp Ala Asp Leu Ser Ala Phe Val Leu Tyr 
3395 3400 3405 

Ser Ser Ser Ser Ala Leu Leu Gly Ser Pro Gly Gin Gly Ala Tyr Ala 
3410 3415 3420 

Ala Ala Asn Ala Phe Leu Asp Gly Phe Ala Arg Tyr Arg Lys Gly Leu 
25 ' 3425 3430 3435 3440 

Gly Leu Pro Ala Leu Ser Leu Ala Trp Gly Leu Trp Gly Ser Asn Ser 
3445 3450 3455 



15 



20 



30 



35 



40 



Arg Met Ala Gly His Leu Asp Gin Ser Gly Met Gin Arg Arg Leu Asn 
3460 3465 3470 

Arg Ser Gly He Met Ala Leu Thr Asp Ala Glu Gly Leu Ala Leu Phe 
3475 3480 3485 

Asp Ala Ala Gin Asp Gly Gly Asp Ala Leu Leu Val Pro Met Arg Leu 
3490 3495 3500 

Asn Arg Thr Ala Leu Arg Ala Ser Gly Arg He Thr Pro Phe Leu Ser 
3505 3510 3515 3520 

Gly Leu Ala Gly Gly Gly Pro Ala Ala Gly Glu Arg Arg Pro Glu Val 
3525 3530 3535 

Ala Ala Val Ser Gly Thr Leu Ala Glu Arg Leu Thr Gly Leu Thr Ala 
3540 3545 3550 

Gin Glu Gly His Ala Leu Val Leu Ala Glu He Arg Ala His Ala Ala 
3555 3560 3565 

Ala Val Leu Gly His Gly Ser Asp Asp Ser He Pro Glu Asp Arg Ala 
50 3570 3575 3580 

Phe Lys Asp Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Met Arg Asn 
3585 3590 3595 3600 



45 



55 



Arg Leu Ser Ala Ala Thr Gly Leu Arg Leu Pro Ala Thr Leu Val Phe 
3605 3610 3615 
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Asp His Pro Thr Pro Gly Glu Leu Ala Gly His Leu Ser Ala Glu Leu 
3620 3625 3630 

Ser Ala Asp Asp Ala Pro Gly Ser Ala Ser Pro Leu Thr Glu Leu Asp 
3635 3640 3645 

Arg Phe Glu Ala Leu Phe Thr Ala Leu Ala Pro Gly Thr Thr Lys Asp 
3650 3655 3660 

Thr Pro Gly Gly Ala Gly Ala Leu Met He Asp Glu Ala Glu Arg Gin 
3665 3670 3675 3680 

Glu He Ala Gly Arg Leu Ala Ala Leu Ala Gly Leu Trp Asn Arg Leu 
3685 3690 3695 

His Gly Thr Thr Thr Ala Pro Glu Asp Gly Asp Thr Val Ala Asp Ala 
3700 3705 3710 

Leu Glu Ala Ala Asp Asp His Glu He Phe Ala Phe Leu Asp Glu Arg 
3715 3720 3725 

Phe * 

3730 



(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1612 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

35 Met Ala Asn Ala Asn Glu Gin Gin Leu Arg Ala Tyr Leu Lys Arg Ala 

15 10 15 

Thr Thr Glu Leu His Arg Thr Ser Glu Gin Leu Arg Glu Glu Arg Ala 
20 25 30 

Arg Ala His Glu Pro He Ala Val Val Gly Met Ala Cys Arg Tyr Pro 
35 40 45 

Gly Gly Ala Asn Thr Pro Glu Gin Phe Trp Glu Leu Leu Asp Thr Gly 
50 55 60 

Thr Asp Ala Ala Ala Pro Met Pro Ser Asp Arg Gly Trp Asp Thr His 
65 70 75 80 

Gly Leu Tyr Asp Pro Asp Pro Ala Ala Ala Gly Arg Thr Tyr Cys Arg 
50 85 90 95 

Glu Gly Gly Phe Leu His Asp Ala Gly Asp Phe Asp Ala Asp Phe Phe 
100 105 110 

55 Gly He Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gin Gin Arg Leu 

115 120 125 
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Leu Leu Glu Thr Ser Trp Glu Ala lie Glu Ala Ala Gly He Asp Pro 
130 135 140 

Arg Gly Leu Arg Gly Ser Arg Thr Gly Val Tyr Val Gly Ala Trp Asp 
145 150 155 160 

Ser Gly Tyr Thr Gly Gin Ala His Ala Pro Ser Ala Glu Leu Glu Ala 
165 170 175 

Asp Leu Leu Thr Gly Gly Val Val Ser Phe Thr Ser Gly Arg He Ala 
180 185 190 

Tyr Thr Leu Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala Cys 
195 200 205 

Ser Ser Ser Leu Val Ala Leu His Asn Ala Ala Gin Ala Leu Arg Arg 
210 215 220 

Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr 
225 230 235 240 

Pro Ala Val Phe Val Gin Phe Ala Arg Gin Arg Gly Leu Ala Pro Asp 
245 250 255 

Gly Arg Cys Lys Ala Phe Ala Asp Ala Ala Asp Gly Phe Gly Pro Ala 
260 265 270 

Glu Gly Val Gly Met Val Leu Val Glu Arg Leu Ser Asp Ala Arg Arg 
275 280 285 

Leu Gly His Pro Val Leu Ala Val Val Cys Gly Ser Ala Val Asn Gin 

290 295 300 

Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ser Gin Glu 
305 310 315 320 

Arg Val He Arg Gin Ala Leu Gly Asn Ala Arg Leu Thr Val Ala Asp 
325 330 335 

Val Asp Val Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro 
340 345 350 

He Glu Ala Gin Ala Leu Leu Gly Thr Tyr Gly Arg Asp Arg Asp Gly 
355 360 365 

Gly Arg Pro Val Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala 
370 375 380 

Gin Ala Ala Ala Gly Val Ala Gly Val lie Lys Met Val Leu Ala Met 
385 390 395 400 

Arg Tyr Gly Trp Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Arg 
405 410 415 

His Val Asp Trp Ser Ala Gly Gly Val Arg Leu Leu Thr Glu Ala Arg 
420 425 430 



Glu Trp Pro Gly Val Asp Arg Pro Arg Arg Ala Ala Val Ser Ala Phe 
435 440 445 
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Gly Val Ser Gly Thr Asn Ala His Leu He Leu Glu Ala Pro Asp Thr 
450 455 460 

Ala Glu Ala Glu Ser Ala Thr Thr Pro Val Arg Ser Glu Val Ser Glu 
465 470 475 480 

Ser Ala Ala Val Leu Asp Ala Arg Ser Gly Val Val Pro Val Val Val 
485 490 495 

Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly Arg Leu Ala 
500 505 510 

Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val Ala Val Thr 
515 520 525 

Met Ala Gly Arg Ser Arg Phe Gly Tyr Arg Ala Val Val Leu Ala Arg 
530 535 540 

Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala Gly Gly Asp 
545 550 555 560 

Pro Asp Ala Gly Val Val Thr Gly Ala Val Val Asp Pro Glu Thr Gly 
565 570 575 

Ser Gly Gly Gly Gly Val Val Leu Val Phe Pro Gly Gin Gly Thr Gin 
580 585 590 

Trp Val Gly Met Gly Ala Gly Leu Leu Gly Ser Ser Glu Val Phe Ala 
595 600 605 

Ala Ser Met Arg Glu Cys Ala Arg Ala Leu Ser Val His Val Gly Trp 
610 615 620 

Asp Leu Leu Glu Val Val Ser Gly Gly Ala Gly Leu Glu Arg Val Asp 
625 630 635 640 

Val Val Gin Pro Val Thr Trp Ala Val Met Val Ser Leu Ala Arg Tyr 
645 650 655 

Trp Gin Ala Met Gly Val Asp Val Ala Ala Val Val Gly His Ser Gin 
660 665 670 

Gly Glu He Ala Ala Ala Thr Val Ala Gly Ala Leu Ser Leu Glu Asp 
675 680 685 

Ala Ala Ala Val Val Ala Leu Arg Ala Gly Leu He Gly Arg Tyr Leu 
690 695 700 

Ala Gly Arg Gly Ala Met Ala Ala Val Pro Leu Pro Ala Gly Glu Val 
705 710 715 ^ 720 

Glu Ala Gly Leu Ala Lys Trp Pro Gly Val Glu Val Ala Ala Val Asn 
725 730 735 

Gly Pro Ala Ser Thr Val Val Ser Gly Asp Arg Arg Ala Val Ala Gly 
740 745 750 



Tyr Val Ala Val Cys Gin Ala Glu Gly Val Gin Ala Arg Leu He Pro 
755 760 765 
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Val Asp Tyr Ala Ser His Ser Arg His Val Glu Asp Leu Lys Cly Clu 
770 775 780 

Leu Glu Arg Val Leu Ser Gly He Arg Pro Arg Ser Pro Arg Val Pro 
785 790 795 800 

Val Cys Ser Thr Val Ala Gly Glu Gin Pro Gly Glu Pro Val Phe Asp 
805 810 815 

Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val Glu Phe Ser Ala 
820 825 830 

Val Val Gly Gly Leu Leu Glu Glu Gly His Arg Arg Phe He Glu Val 
835 840 845 

Ser Ala His Pro Val Leu Val His Ala He Glu Gin Thr Ala Glu Ala 
850 855 860 

Ala Asp Arg Ser Val His Ala Thr Gly Thr Leu Arg Arg Gin Asp Asp 
865 870 875 880 

Ser Pro His Arg Leu Leu Thr Ser Thr Ala Glu Ala Trp Ala His Gly 
885 890 895 

Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly His Leu Thr Thr 
900 905 910 

Leu Pro Thr Tyr Pro Phe Asn His His His Tyr Trp Leu Asp Thr Thr 
915 920 925 

Pro Thr Thr Pro Ala Thr Thr Thr Gin Ser Pro Thr Asp Ala Trp Arg 
930 935 940 

Tyr Arg Val Thr Trp Lys Ala Leu Thr Glu Glu Ser Thr Pro Ala Ser 
945 950 955 960 

Ser Pro Ser Gly His Trp Leu Leu Val Thr Pro Pro Thr Pro Glu Gly 
965 970 975 

Arg Thr Leu Gly Asp Arg Ala Ala Gly Ala Leu Ala Arg Gin Gly Ala 
980 985 990 

Thr Val Glu Arg Leu Val Val Asp Pro Val Ala Val Gly Arg Asp Gly 
995 1000 1005 

Leu Ala Ala Arg Leu Gly Glu Arg Trp Asp Gly Val Leu Ser Leu Leu 
1010 1015 1020 

Gly Ala Asp Glu Arg Pro Leu Pro Arg His Pro Ala Leu Asn Arg Ala 
1025 1030 1035 1040 

Val Met Gly Thr Thr Leu Leu Ala Gin Ala Ala Leu Asp Ala Gly Cys 
1045 1050 1055 

Glu Ala Arg He Trp Ala Val Thr Arg Glu Ala Val Ala Val Ser Pro 
1060 1065 1070 

Ser Glu Val Pro Arg Asp Ala Gly Ala Gin Leu Trp Gly Leu Gly Arg 
1075 1080 1085 
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Gly lie Ala Leu Glu His Pro Ser Leu Trp Gly Gly Leu lie Asp Leu 
1090 1095 1100 

Pro Ala Val Pro Asp Glu Arg Ala Trp Ala Arg Ala Val Arg Arg Leu 
1105 1110 1115 1120 

Val Pro His Gly Glu Asp Gin He Ala Ala Arg Ala Ser Gly Ala Tyr 
1125 1130 1135 

Gly Arg Arg Leu Leu Pro Ala Pro Pro Ala Ala Ser Arg Arg Thr Cys 
1140 1145 1150 

Thr Pro Ser Gly Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly 
1155 1160 1165 

Gly His Leu Ala Arg Arg Leu Ala Arg Gly Gly Thr Gly His Leu Val 
1170 1175 1180 

Leu Thr Ser Arg Arg Gly Pro Asp Ala Pro Gly Ala Gly Glu Leu Ala 
1185 1190 1195 1200 

Gly Glu Leu Ala Ser Leu Gly Ala Lys Val Thr Val Ala Ala Cys Asp 
1205 1210 1215 

Met Ala Asp Arg Glu Ala Val Arg Ala Leu Leu Asp Glu His Arg Pro 
1220 1225 1230 

Thr Ala Val Phe His Thr Ala Gly Thr Pro His Ser Ala Glu Phe Thr 
1235 1240 1245 

Ala Leu Asp Glu Thr Thr Thr Ala Gly Val Tyr Gly Gly Lys Val Leu 
1250 1255 1260 

Gly Ala Arg His Leu Asp Glu Leu Thr Arg Glu Leu Gly He Gly Leu 
1265 1270 1275 1280 

Asp Ala Phe Val Leu Phe Ser Ser Gly Ala Ala Val Trp Gly Ser Gly 
1285 1290 1295 

Gly Gin Thr Ala Tyr Gly Ala Ala Asn Ala Ala Leu Asp Ala Leu Ala 
1300 1305 1310 

Glu Arg Arg Arg Ala Ala Gly Leu Pro Ala Thr Ser Val Ala Trp Gly 
1315 1320 1325 

Leu Trp Gly Gly Gly Gly Met Gly Glu Gly Asp Gly Glu Glu Phe Leu 
1330 1335 1340 

Ser Arg Arg Gly Leu Gly Val Met Pro Pro Glu Asp Ala Leu Glu Ala 
1345 1350 1355 1360 

Leu Asp Arg Ala Leu Asp Arg Glu Asp Thr Thr Val Val Val Ala Asp 
1365 1370 1375 

Val Asp Trp Glu Arg Phe Ala Pro Ala Phe Thr Ala Phe Arg Pro Ser 
1380 1385 1390 

Ala Leu He Ser Arg Leu Val Ser Asp Gly Gly Glu Ala Gly Gly Gin 
1395 1400 1405 
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Asp Ala Pro Asp Gly Thr Leu Phe Ala Ala Gly Phe Ala Ala Ala Gly 
1410 1415 1420 

Pro Leu Glu Arg Gin Glu Met Leu Leu Gly Leu Val Arg Arg His Val 
1425 1430 1435 1440 

Ala Ala Val Leu Gly His Pro Gly Thr Ala Asp He Gly Pro Asp Arg 
1445 1450 1455 

Ala Phe Lys Glu Leu Gly Phe Ser Ser Val Thr Ala Val Glu Leu Ala 
1460 1465 1470 

Gly Arg Leu Gly Arg Glu Cys Gly Arg Lys Leu Pro Pro Thr Leu Val 
1475 1480 1485 

Phe Asp His Pro Thr Ala Ala Ala Ala Val Glu His Leu Ala Glu Leu 
1490 1495 1500 

Leu Thr Pro Pro Ala Gly Pro Ala Ala Gly Pro Arg Glu Glu Glu Ala 
1505 1510 1515 1520 

Arg Ala Ala Leu Ala Arg Val Pro Leu Glu Arg Leu Arg Glu Ala Gly 
1525 1530 1535 

Leu Leu Asp Ala Leu Leu Arg Leu Ala Ala Asp Glu Ser Gly Ala Thr 
1540 1545 1550 

Thr Pro Arg Thr Ser Ala Ala Ser Gly Ala Pro Arg Gly Arg Glu Glu 
1555 1560 1565 

Pro Asp Gly Arg Gly Glu Pro Asp Gly Ser Gly His Arg Glu Ser Pro 
1570 1575 1580 

Asp Ala Ala Gly Gly Ser Asp Ala Leu Asp Asp Leu Asp Gly Asp Ala 
1585 1590 1595 1600 

Leu Val Arg Leu Ala Leu Gly Glu Pro Gly Glu * 
1605 1610 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1842 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Met Ser Ala Glu Arg Leu Thr Glu Ala Leu Arg Thr Ser Leu 
15 10 15 

Lys Glu Ala Glu Arg Leu Arg Arg Gin Asn Arg Glu Leu Arg Ala Ala 
20 25 30 

Arg Asp Ala Ala Arg Glu Pro He Ala Val Val Gly Met Ala Cys Arg 
35 40 45 
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Tyr Pro Gly Gly Val Thr Gly Pro Glu Glu Leu Tip Glu Leu Val Ala 
50 .55 60 

Gly Gly Arg Asp Ala lie Gly Pro Phe Pro Val Asp Arg Gly Trp Asp 
65 70 75 " 80 

Val Ala Ser Val Tyr Asp Pro Asp Pro Glu Ser Lys Gly Thr Thr Tyr 
85 90 95 

Cys Arg Glu Gly Gly Phe Leu Glu Gly Ala Gly Asp Phe Asp Ala Ala 
100 105 110 

Phe Phe Gly lie Ser Pro Arg Glu Ala Leu Val Met Asp Pro Gin Gin 
115 120 125 

Arg Leu Leu Leu Glu Val Ser Trp Glu Ala Leu Glu Arg Ala Gly He 
130 135 140 

Asp Pro Ser Ser Leu Arg Gly Ser Arg Gly Gly Val Tyr Val Gly Ala 
145 150 155 160 

Ala His Gly Ser Tyr Ala Ser Asp Pro Arg Leu Val Pro Glu Gly Ser 
165 170 175 

Glu Gly Tyr Leu Leu Thr Gly Ser Ala Asp Ala Val Met Ser Gly Arg 
180 185 190 

He Ser Tyr Ala Leu Gly Leu Glu Gly Pro Ser Met Thr Val Glu Thr 
195 200 205 

Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Val Arg Ala Leu 
210 215 220 

Arg His Gly Glu Cys Gly Leu Ala Leu Ala Gly Gly Val Ala Val Met 
225 230 235 240 

Ala Asp Pro Ala Ala Phe Val Glu Phe Ser Arg Gin Lys Gly Leu Ala 
245 250 " * 255 

Ala Asp Gly Arg Cys Lys Ala Phe Ser Ala Ala Ala Asp Gly Thr Gly 
260 265 270 

Trp Ala Glu Gly Val Gly Val Leu Val Leu Glu Arg Leu Ser Asp Ala 
275 280 285 

Arg Arg Ala Gly His Thr Val Leu Gly Leu Val Thr Gly Thr Ala Val 
290 295 300 

Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ala 
305 310 315 " 320 

Gin Gin Arg Val He Ala Glu Ala Leu Ala Asp Ala Gly Leu Ser Pro 
325 330 335 

Glu Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly 
340 345 ° 350 



Asp Pro He Glu Ala Gly Ala Leu Leu Ala Ala Ser Gly Arg Asn Arg 
355 360 365 
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Ser Gly Asp His Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly 
370 375 380 

His Ala Gin Ala Ala Ala Gly Val Gly Gly Val He Lys Met Leu Gin 
385 390 395 400 

Ala Leu Arg His Gly Leu Leu Pro Arg Thr Leu His Ala Asp Glu Pro 
405 410 415 

Thr Pro His Ala Asp Trp Ser Ser Gly Arg Val Arg Leu Leu Thr Ser 
420 425 430 

Glu Val Pro Trp Gin Arg Thr Gly Arg Pro Arg Arg Thr Gly Val Ser 
435 440 445 

Ala Phe Gly Val Gly Gly Thr Asn Ala His Val Val Leu Glu Glu Ala 
450 455 460 

Pro Ala Pro Pro Ala Pro Glu Pro Ala Gly Glu Ala Pro Gly Gly Ser 
465 470 475 480 

Arg Ala Ala Glu Gly Ala Glu Gly Pro Leu Ala Trp Val Val Ser Gly 
485 490 495 

Arg Asp Glu Pro Ala Leu Arg Ser Gin Ala Arg Arg Leu Arg Asp His 
500 505 510 

Leu Ser Arg Thr Pro Gly Ala Arg Pro Arg Asp He Ala Phe Ser Leu 
515 520 525 

Ala Ala Thr Arg Ala Ala Phe Asp His Arg Ala Val Leu He Gly Ser 
530 535 540 

Asp Gly Ala Glu Leu Ala Ala Ala Leu Asp Ala Leu Ala Glu Gly Arg 
545 550 555 560 

Asp Gly Pro Ala Val Val Arg Gly Val Arg Asp Arg Asp Gly Arg Met 
565 570 575 

Ala Phe Leu Phe Thr Gly Gin Gly Ser Gin Arg Ala Gly Met Ala His 
580 585 590 

Asp Leu His Ala Ala His Thr Phe Phe Ala Ser Ala Leu Asp Glu Val 
595 600 605 

Thr Asp Arg Leu Asp Pro Leu Leu Gly Arg Pro Leu Gly Ala Leu Leu 
610 615 620 

Asp Ala Arg Pro Gly Ser Pro Glu Ala Ala Leu Leu Asp Arg Thr Glu 
625 630 635 640 

Tyr Thr Gin Pro Ala Leu Phe Ala Val Glu Val Ala Leu His Arg Leu 
645 650 655 

Leu Glu His Trp Gly Met Arg Pro Asp Leu Leu Leu Gly His Ser Val 
660 665 670 



Gly Glu Leu Ala Ala Ala His Val Ala Gly Val Leu Asp Leu Asp Asp 
675 680 685 
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Ala Cys Ala Leu Val Ala Ala Arg Gly Arg Leu Met Gin Arg Leu Pro 
690 695 700 

Pro Gly Gly Ala Met Val Ser Val Arg Ala Gly Glu Asp Glu Val Arg 
705 710 715 720 

Ala Leu Leu Ala Gly Arg Glu Asp Ala Val Cys Val Ala Ala Val Asn 
725 730 735 

Gly Pro Arg Ser Val Val He Ser Gly Ala Glu Glu Ala Val Ala Glu 
740 745 750 

Ala Ala Ala Gin Leu Ala Gly Arg Gly Arg Arg Thr Arg Arg Leu Arg 
755 760 765 

Val Ala His Ala Phe His Ser Pro Leu Met Asp Gly Met Leu Ala Gly 
770 775 780 

Phe Arg Glu Val Ala Ala Gly Leu Arg Tyr Arg Glu Pro Glu Leu Thr 
785 790 795 800 

Val Val Ser Thr Val Thr Gly Arg Pro Ala Arg Pro Gly Glu Leu Thr 
805 810 815 

Gly Pro Asp Tyr Trp Val Ala Gin Val Arg Glu Pro Val Arg Phe Ala 
820 825 830 

Asp Ala Val Arg Thr Ala His Arg Leu Gly Ala Arg Thr Phe Leu Glu 
835 840 845 

Thr Gly Pro Asp Gly Val Leu Cys Gly Met Ala Glu Glu Cys Leu Glu 
850 855 860 

Asp Asp Thr Val Ala Leu Leu Pro Ala He His Lys Pro Gly Thr Ala 
865 870 875 880 

Pro His Gly Pro Ala Ala Pro Gly Ala Leu Arg Ala Ala Ala Ala Ala 
885 890 895 

Tyr Gly Arg Gly Ala Arg Val Asp Trp Ala Gly Met His Ala Asp Gly 
900 905 910 

Pro Glu Gly Pro Ala Arg Arg Val Glu Leu Pro Val His Ala Phe Arg 
915 920 925 

His Arg Arg Tyr Trp Leu Ala Pro Gly Arg Ala Ala Asp Thr Asp Asp 
930 935 940 

Trp Met Tyr Arg He Gly Trp Asp Arg Leu Pro Ala Val Thr Gly Gly 
945 950 955 960 

Ala Arg Thr Ala Gly Arg Trp Leu Val He His Pro Asp Ser Pro Arg 
965 970 975 

Cys Arg Glu Leu Ser Gly His Ala Glu Arg Ala Leu Arg Ala Ala Gly 
980 985 990 



Ala Ser Pro Val Pro Leu Pro Val Asp Ala Pro Ala Ala Asp Arg Ala 
995 1000 1005 
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Ser Phe Ala Ala Leu Leu Arg Ser Ala Thr Gly Pro Asp Thr Arg Gly 
1010 1015 1020 

Asp Thr Ala Ala Pro Val Ala Gly Val Leu Ser Leu Leu Ser Glu Glu 
1025 1030 1035 1040 

Asp Arg Pro His Arg Gin His Ala Pro Val Pro Ala Gly Val Leu Ala 
1045 1050 1055 

Thr Leu Ser Leu Met Gin Ala Met Glu Glu Glu Ala Val Glu Ala Arg 
1060 1065 1070 

Val Trp Cys Val Ser Arg Ala Ala Val Ala Ala Ala Asp Arg Glu Arg 
1075 1080 1085 

Pro Val Gly Ala Gly Ala Ala Leu Trp Gly Leu Gly Arg Val Ala Ala 
1090 1095 1100 

Leu Glu Arg Pro Thr Arg Trp Gly Gly Leu Val Asp Leu Pro Ala Ser 
1105 1110 1115 1120 

Pro Gly Ala Ala His Trp Ala Ala Ala Val Glu Arg Leu Ala Gly Pro 
1125 1130 1135 

Glu Asp Gin He Ala Val Arg Ala Ser Gly Ser Trp Gly Arg Arg Leu 
25 * 1140 1145 1150 

Thr Arg Leu Pro Arg Asp Gly Gly Gly Arg Thr Ala Ala Pro Ala Tyr 
1155 1160 1165 



15 



20 



30 



35 



40 



Arg Pro Arg Gly Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly 
1170 1175 1180 

Gly His Leu Ala Arg Trp Leu Ala Ala Ala Gly. Ala Glu His Leu Ala 
1185 1190 1195 1200 

Leu Thr Ser Arg Arg Gly Pro Asp Ala Pro Gly Ala Ala Gly Leu Glu 
1205 1210 1215 

Ala Glu Leu Leu Leu Leu Gly Ala Lys Val Thr Phe Ala Ala Cys Asp 
1220 1225 1230 

Thr Ala Asp Arg Asp Gly Leu Ala Arg Val Leu Arg Ala He Pro Glu 
1235 1240 1245 

Asp Thr Pro Leu Thr Ala Val Phe His Ala Ala Gly Val Pro Gin Val 
1250 1255 1260 

Thr Pro Leu Ser Arg Thr Ser Pro Glu His Phe Ala Asp Val Tyr Ala 
1265 1270 1275 1280 

Gly Lys Ala Ala Gly Ala Ala His Leu Asp Glu Leu Thr Arg Glu Leu 
so ^ 1285 1290 1295 

Gly Ala Gly Leu Asp Ala Phe Val Leu Tyr Ser Ser Gly Ala Gly Val 
1300 1305 1310 

55 Trp Gly Ser Ala Gly Gin Gly Ala Tyr Ala Ala Ala Asn Ala Ala Leu 

1315 1320 1325 



45 
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Asp Ala Leu Ala Arg Arg Arg Ala Ala Asp Gly Leu Pro Ala Thr Ser 
1330 1335 1340 

He Ala Trp Gly Val Trp Gly Gly Gly Gly Met Gly Ala Asp Glu Ala 
1345 1350 1355 1360 

Gly Ala Glu Tyr Leu Gly Arg Arg Gly Met Arg Pro Met Ala Pro Val 
1365 1370 1375 

Ser Ala Leu Arg Ala Met Ala Thr Ala He Ala Ser Gly Glu Pro Cys 
1380 1385 1390 

Pro Thr Val Thr His Thr Asp Trp Glu Arg Phe Gly Glu Gly Phe Thr 
1395 1400 1405 

Ala Phe Arg Pro Ser Pro Leu He Ala Gly Leu Gly Thr Pro Gly Gly 
1410 1415 1420 

Gly Arg Ala Ala Glu Thr Pro Glu Glu Gly Asn Ala Thr Ala Ala Ala 
20 1425 1430 1435 1440 

Asp Leu Thr Ala Leu Pro Pro Ala Glu Leu Arg Thr Ala Leu Arg Glu 
1445 1450 1455 



15 



25 



30 



35 



Leu Val Arg Ala Arg Thr Ala Ala Ala Leu Gly Leu Asp Asp Pro Ala 
1460 1465 1470 

Glu Val Ala Glu Gly Glu Arg Phe Pro Ala Met Gly Phe Asp Ser Leu 
1475 1480 1485 

Ala Thr Val Arg Leu Arg Arg Gly Leu Ala Ser Ala Thr Gly Leu Asp 
1490 1495 1500 

Leu Pro Pro Asp Leu Leu Phe Asp Arg Asp Thr Pro Ala Ala Leu Ala 
1505 1510 1515 1520 

Ala His Leu Ala Glu Leu Leu Ala Thr Ala Arg Asp His Gly Pro Gly 
1525 1530 1535 

Gly Pro Gly Thr Gly Ala Ala Pro Ala Asp Ala Gly Ser Gly Leu Pro 
1540 1545 1550 

Ala Leu Tyr Arg Glu Ala Val Arg Thr Gly Arg Ala Ala Glu Met Ala 
1555 1560 1565 

Glu Leu Leu Ala Ala Ala Ser Arg Phe Arg Pro Ala Phe Gly Thr Ala 
45 1570 1575 1580 

Asp Arg Gin Pro Val Ala Leu Val Pro Leu Ala Asp Gly Ala Glu Asp 
1585 1590 1595 * 1600 



40 



50 



55 



Thr Gly Leu Pro Leu Leu Val Gly Cys Ala Gly Thr Ala Val Ala Ser 
1605 1610 1615 

Gly Pro Val Glu Phe Thr Ala Phe Ala Gly Ala Leu Ala Asp Leu Pro 
1620 1625 1630 

Ala Ala Ala Pro Met Ala Ala Leu Pro Gin Pro Gly Phe Leu Pro Gly 
1635 1640 1645 
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Glu Arg Val Pro Ala Thr Pro Glu Ala Leu Phe Glu Ala Gin Ala Glu 
1650 1655 1660 

Ala Leu Leu Arg Tyr Ala Ala Gly Arg Pro Phe Val Leu Leu Gly His 
1665 1670 1675 1680 

Ser Ala Gly Ala Asn Met Ala His Ala Leu Thr Arg His Leu Glu Ala 
1685 1690 1695 

Asn Gly Gly Gly Pro Ala Gly Leu Val Leu Met Asp He Tyr Thr Pro 
1700 1705 1710 

Ala Asp Pro Gly Ala Met Gly Val Trp Arg Asn Asp Met Phe Gin Trp 
1715 1720 1725 

Val Trp Arg Arg Ser Asp He Pro Pro Asp Asp His Arg Leu Thr Ala 
1730 1735 1740 

Met Gly Ala Tyr His Arg Leu Leu Leu Asp Trp Ser Pro Thr Pro Val 
1745 1750 1755 1760 

Arg Ala Pro Val Leu His Leu Arg Ala Ala Glu Pro Met Gly Asp Trp 
1765 1770 1775 

Pro Pro Gly Asp Thr Gly Trp Gin Ser His Trp Asp Gly Ala His Thr 
1780 1785 1790 

Thr Ala Gly He Pro Gly Asn His Phe Thr Met Met Thr Glu His Ala 
1795 1800 1805 

Ser Ala Ala Ala Arg Leu Val His Gly Trp Leu Ala Glu Arg Thr Pro 
1810 1815 1820 

Ser Gly Gin Gly Gly Ser Pro Ser Arg Ala Ala Gly Arg Glu Glu Arg 
1825 1830 1835 1840 



Pro 



(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 350.. 14002 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 14046.. 20036 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
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CTCGCTGAAC 
GCAGACCGCT 

5 

CAGTTCGTTC 
GCCCACCCCG 

10 CGCGCGGTCG 
GGTGGGCGCG 
GCACCGTGCG 

15 TGCTGCCGGG 
GGTGTTGTTC 
GGGTGGGGTG 

20 

TCGGTCGTTG 
TGCGGGTGCC 
GGGTCGGACG 

25 

GTTGGAGGCT 
TGCTGCGTAT 
30 GGGTGGGTTG 
GTCGGTGGTG 
TGGGCCGCGG 
GCTGATGGGG 
GTTGATGGAG 
GGTGCGGCCG 

40 

GTTGGGGGAT 
GGTGGGGGTG 
GCTGACGGGG 

45 

GATGCGGCGG 
CCGGGACGCC 
50 CCTCCCCACC 
CACGGCCGAC 
GGACCCGGAG 

55 

GTCACCGGAG 



TTCGCCACCC CCTCTCCGGC 
GCCGCCGAGC TGCCGCTCGC 
GGCATCGGTG GCACCAACTG 
GCCGTCTCCG TCGCCGCCTC 
GAGGGGGCGT TGCGGGCGCA 
GATCCGCGGG ATGTGGCTTA 
GTGGTGCCGT GTGGTGGGCG 
AGGGTGTCTG GGGGTGTGCG 
ACGGGTCAGG GTGCGCAGTG 
TTTGCGGAGG TGCTGGATGA 
CGGGATGTGA TGTTCGGCGA 
GGTGCGGGTG CTGGGGTCGG 
GAGTTTGCTC AGCCTGCGTT 
CGGGGTGTGG AGGTGTCGGT 
GTGGCGGGGG TGTTGTCGTT 
ATGGGTGGGT TGCCGGTGGG 
CGGGGGGTTG TTGAGGGGTT 
TCGGTGGTGT TGTCGGGTGA 
GATGGGGTGG AGTGCCGGCG 
CCGGTGTTGG GGGAGTTCCG 
GGTGTGGTGG TCGTGTCGGG 
CCGGGGTATT GGGTGCGTCA 
GTGCGTGGTC TCGGTGTGGG 
ATGGCGGGTG AGTGCCTGGG 
GGCCGTGCGG AGCGGGAGGT 
GGCCTGGACG CCACGGCACT 
TACCCCTTCC AACGCCGTAC 
GCCGGGGCGG GTGTGACCGC 
AGCACCGAGG GGACGTCCCA 
CCGATGTCCC CCGAGGATGC 



CATCCCGCTG GACCAGCTCC 
CCCGGGCGGC GCACCCCTGC 
CCATGTGGTC CTGGAACACC 
GCTTCCGGAC GTCCCGCCGC 
GGCGGTGCGG TTGGGTGAGT 
TTCGCTGGCT TCGACGCGGA 
TGGGGAGCTC GTCGCTGCTC 
GTCCGGGCGG GCTGTGCCGG 
GGTTGGTATG GGGCGTGGGT 
GGTGTTGTCG ATGGTGGGGG 
CGTCGACGTG GACGCGGGTG 
TTCTGGTTCC GGTTCTGTGG 
GTTTGCGTTG GAGGTGGCGT 
GGTGTTGGGT CATTCGGTGG 
GGGTGATGCG GTGCGGTTGG 
TGGGGGGATG TGGTCGGTGG 
GGGGGAGTGG GTGTCGGTTG 
TGTGGGTGTG CTGGAGTCGG 
GTTGGATGTG TCGCATGGGT 
GGGGGTTGTG GAGTCGTTGG 
TGTCTCGGGT GGGGTGGTGG 
TGCGCGGGAG GCGGTGCGTT 
GACGTTGGTG GAGGTGGGTC 
GGCCGGTGAT GATGTGGTGG 
GTTCGAGGCG GCGCTGGCGA 
CCACACCGGG AGCACCGGCC 
CCACTGGTCG CCCGCGCTGA 
CACCGATGCC GTGGGGCACA 
CAGGGACACG GACGACGAGG 
CGTCCGCCTG GTCCGCGAGA 



GGCTGAAGGT 1500 

TGGCGGGTGT 1560 

TGCCCTCCCG 1620 

ACGTGGAGCG 1740 

CTCTTTTCGA 1800 

TTGGTGGGTT 1860 

GTGGGGTGGG 1920 

TGTATGCGGG 1980 

AGGTGGATGG 2040 

CCGGGGCTGA 2100 

GTGGGTTGTT 2160 

TGTTCCGGGC 2220 

GGGAGGTGGC 2280 

TGGTGGCGCG 2340 

GGGCGTCGGA 2400 

CGGCGGTCAA 2460 

TGGTTGCCTC 2520 

TTCATTCGGT 2580 

AGTTCGGTCG 2640 

GTTCGGGGGA 2700 

TCGCGGATGG 2760 

CGCATGGGGT 2820 

TCGTGCCGGC 2880 

CGGTGTTCAC 2940 

GGCGCATCGA 3000 

GCCGGCCGGT 3060 

GCGTCTCCCC 3120 

CGGACTCGGC 3180 

GCACCGCGGC 3240 
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15 



20 
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30 



35 



40 



45 



50 



55 



CGTCCTGGGC CACGACGATC CCGGCGAGGT CGCGCTCGAC CGCACCTTCA CCTCCCAGGG 3300 

CATGGACTCG GTGACCGCGG TCGAGCTGTG CGACCTGCTG AAGGGCGCCT CGGGGCTCCC 3360 

CCTCGCCGCC ACGCTCGTCT ACGACCTCCC CACCCCGCGT GCCGTCGCCG AGCACATCGT 3420 

GGAAGCCGCG GGCGGGCCGA AGGACTCGGT TGCCGGTGGG CCCGGAGTGC TCTCGTCGGC 3480 

CGCGGTAGGG GTGTCGGACG CCCGGGGCGG CAGCCGGGAC GACGACGACC CGATCGCCAT 3540 

CGTGGGTGTC GGCTGCCGGC TCCCCGGCGG CGTCGACTCG CGCGCCGCTC TCTGGGAGCT 3600 

GCTGGAGTCC GGCGCCGACG CCATCTCGTC CTTCCCCACC GACCGCGGCT GGGACCTCGA 3660 

CGGGCTGTAC GACCCCGAGC CCGGGACGCC CGGCAAGACC TATGTGCGGG AGGGCGGGTT 3720 

CCTGCACTCG GCGGCCGAGT TCGACGCGGA GTTCTTCGGG ATATCGCCGC GCGAGGCCAC 3780 

GGCCATGGAC CCGCAGCAGC GCTTGCTGCT GGAAGCGTCG TGGGAGGCCC TCGAGGACGC 3840 

CGGAGTGCTC CCCGAGTCAC TGCGCGGCGG CGACGCCGGA GTGTTCGTCG GCGCCACCGC 3900 

ACCGGAGTAC GGGCCGAGGC TTCACGAGGG AGCGGACGGA TACGAGGGGT ACCTGCTCAC 3960* 

CGGCACCACC GCGAGCGTGG CCTCCGGCCG GATCGCCTAC ACCCTCGGCA CCGGCGGACC 4020 

GGCGCTCACC GTCGACACCG CGTGCTCCTC GTCCCTGGTG GCGCTGCACC TCGCCGTCCA 4080 

GGCGCTGCGC CGGGGCGAGT GCGGGCTGGC TCTGGCGGGC GGCGCCACGG TGATGTCGGG 4140 

GCCCGGCATG TTCGTGGAGT TCTCGCGGCA GCGCGGGCTC GCCCCCGACG GCCGCTGCAT 4200 

GCCGTTCTCC GCCGATGCCG ACGGTACGGC CTGGTCCGAG GGTGTCGCCG TACTGGCACT 4260 

GGAGCGGCTC TCCGACGCCC GGCGTGCGGG ACACCGGGTG CTGGGCGTGG TGCGGGGCAG 4320 

TGCGGTCAAC CAGGACGGTC CCAGCAACGG CCTGACCGCT CCCAACCGCT CCGCGCAGGA 4380 

GGGCGTCATC CGAGCTCCCC TGGCCGACGC CGGCCTCGCG CCGGGTGACG TCGACGCGGT 4440 

GGAGGCGCAC GGTACGGGGA CGGCGCTGGG CGATCCGATC GAGGCGAGCG CGCTGCTGGC 4500 

CACGTACGGG CGTGAGCGGG TGGGCGACCC CTTGTCGCTC GGGTCGCTCA AGTCCAACGT 4560 

CGGTCACACC CAGGCCGCCG CGGGGGCCGC GGGTGTOGTC AAGATGCTGC TTGCCCTGGA 4620 

GCACGGCACG CTGCCGCGGA CACTTCACGC GGACCGGCCC AGCACGCACG TCGACTGGTC 4680. 

GTCGGGCACC GTCGCCCTGC TGGCAGAGGC GCGCCGGTGG CCCCGGCGGT CGGACCGCCC 4740 

GCGCCGGGCG GCTOTCTCGT CGTTCGGGAT CAGTGGGACG AACGCGCATC TGATCATCGA 4800 

GGAGGCGCCG GAGTGGGTCG AGGACATCGA CGGCGTCGCT GCTCCTGACC GCGGTACCGC 4860 

GGACGCGGCT GCTCCGTCGC CGCTGTTGTT GTCCGCGCGG TCGGAGGGGG CGTTCCGGGC 4920 

GCAGGCGGTG CGGTTGGGTG AGTACGTGGA GCGGGTGGGT GCGGATCCGC GGGATCTGGC 4980 

TTATTCGCTG GCTTCGACGC GGACTCTTTT CGAGCACCGT GCGGTGGTCC CGTCTGGTGG 5040 
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GCGTCGGGAG CTCGTCGCTG CTCTTGGTGG GTTTGCTGCC 


GGGAGGGTGT 


CTGGGGGTGT 


5100 


5 


GCGGTCCGGG CGGGCTGTGC CGGGTGGGGT GGGGGTGTTG 


TTCACGGGTC 


AGGGTGCGCA 


5160 




GTGGGTTGGT ATCGGGCGTG GGTTGTATGC GGGGGGTGGG 


GTGTTTGCGG 


AGGTGCTGGA 


5220" 




TCAGGTGTTG TCGATGGTGG GGGAGGTCGA TGGTCGGTCG 


TTGCGGGATC 


TGATGTTCGG 


5280 


10 


CGACGTCGAC GTCGACGCGG GTGCCGGGGC TGATGCGGGT 


GCCGGTGCGG 


GTGCTGGGGT 


5340 




CGGTTCTGGT TCCGGTTCTG TGGGTGGGTT GTTGGGTCGG 


ACGGAGTTTG 


CTCAGCCTGC 


5400 




GCTGTTTGCG TTGGAGGTGG CGTTGTTCCG GGCGTTGGAG 


GCTCGGGGTG 


TGGAGGTGTC 


5460 


15 


GGTGGTGTTG GGTCATTCGG TGGGGGAGGT GGCTGCTGCG 


TATGTGGCGG 


GGGTGTTGTC 


5520 




GTTGGGTGAT GCGGTGCGGT TGGTGGTGGC GCGGGGTGGG 


TTGATGGGTG 


GGTTGCCGGT 


5580 




GGGTGGGGGG ATOTGGTCGG TGGGGGCGTC GGAGTCGGTG 


GTGCGGGGGG 


TTGTTGAGGG 


5640 


20 


GTTGGGGGAG TGGGTGTCGG TTGCGGCGGT GAATGGGCCG 


CGGTCGGTGG 


TGTTGTCGGG 


5700 




TGATGTGGGT GTGCTCGAGT CGGTGGTTGC CTCGCTGATG 


GGGGATGGGG 


TGGAGTGCCG 


5760 


25 


GCGGTTGGAT GTGTCGCATG GGTTTCATTC GGTGTTGATG 


GAGCCGGTGT 


TGGGGGAGTT 


5820 


CCGGGGGGTT GTGGAGTCGT TGGAGTTCGG TCGGGTGCGG 


CCGGGTGTGG 


TGGTGGTGTC 


5880 




GGGTGTGTCG GGTGGGGTGG TGGGTTCGGG GGAGTTGGGG 


GATCCGGGGT 


ATTGGGTGCG 


5940. 


30 


TCATGCGCGG GAGGCGGTGC GTTTCGCGGA TGGGGTGGGG 


GTGGTGCGTG 


GTCTGGGTGT 


6000 




GGGGACGTTG GTGGAGGTGG GTCCGCATGG GGTGCTGACG 


GGGATGGCGG 


GTGAGTGCCT 


6060 




GGGGGCCGGT GATGATGTGG TGGTGGTGCC GGCGATGCGG 


CGGGGCCGTG 


CGGAGCGGGA 


6120 


35 


GGTGTTCGAG GCGGCGCTGG CGACGGTCTT CACCCGGGAC 


GCCGGCCTGG 


ACGCCACGGC 


6180 




ACTCCACACC GGGAGCACCG GCCGGCGCAT CGACCTCCCC 


ACCTACCCCT 


TCCAACGCGA 


6240 


40 


CCGCTACTGG CTGGACCCCG TTCGCACCGC CGTGACCGGC 


GTCGAGCCCG 


CCGGCTCGCC 


6300 


GGCGGACGCT CGGGCCACTG AGCGGGGACG GTCGACGACG 


GCCGGGATCC 


GCTACCGCGT 


6360 




CGCTTGGCAG CCGGCCGTCG TCGACCGCGG CAACCCCGGG 


CCTGCCGGTC 


ATGTGCTGCT 


6420 


45 


TCTGGCCCCG GACGAGGACA CGGCCGACTC CGGACTCGCC 


CCCGCGATCG 


CACGTCAACT 


6480 




CGCCGTGCGC GGGGCCGAGG TCCACACCGT CGCCGTGCCG 

• 


GTCGGTACAG 


GCCGGGAGGC 


6540 




AGCCGGGGAC CTCTTGCGGG CCGCCGGTGA CGGTGCCGCC 


CGCAGCACCC 


GAGTTCTGTG 


6600 


50 


GCTCGCCCCG GCCGAGCCGG ACGCGGCCGA CGCCGTCGCC 


CTCGTCCAGG 


CGCTGGGCGA 


6660 




GGCGGTACCC GAAGCCCCGC TCTGGATCAC CACCCGTGAG 


GCGGCGGCCG 


TGCGGCCGGA 


6720 




CGAGACCCCT TCCGTCGGGG GCGCTCAGCT GTGGGGACTC 


GGACAGGTCG 


CCGCGCTCGA 


6780 


55 


ACTGGGGCGG CGCTGGGGCG GCTTGGCGGA CCTCCCCGGG 


AGTGCGTCGC 


CCGCGGTCCT 


6840 
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CCGTACGTTC GTCGGGGCGC TGCTCGCCGG 
s CGGCGTCCAT GTCCGCCGTG TGGTTCCCGC 

CGTCACCACG GCCCCCGCCA CCGCCGTCGG 
GGTCGTGCCG GACGACCGGT GGTCCTCCGG 
10 CCTGGGTGCG CAGGTCGCCC GCAGGCTCGC 

GGGCCGGCGC GGCGCGGCCG GCCCCGGAGT 
CGGTTCCGAA GTGGCCGTCG AGGCCTGCGA 
15 CCTCGCGGGC CTCCCCGAGG AGCGGCCCCT 

CGACGACGGT GTGCTCGACT CGCTCACCTC 
GGTCACCGCC GCCCGTCACC TGGACGAGCT 

20 

GCTCTTCTCC TCCATCGTCG GCGTGTGGGG 
CAACGCCGCG CTCGACGCCC TGGCGCAGCG 
GATCGCCTGG GGGCCGTGGG CCGGTGCCGG 

25 

CGAACGGGAC GGCGTCACGG CCCTGGACCC 
GGTGGGCGCC GGCGGGACCT CTGCCGCAGG 
30 CGTCGCCGAC GTGGACTGGG AGACCTTCGT 

GCTCTTCGAC GGCGTCTCCG CCGCCCGTTC 
TGCCGCTCTC ACCCCAGGGA CGCGGCCGGG 
CGGGGGCGAG GGCCGGCCGT GGCTCTCCGT 
TCTGCTCACG CTTGTGCGCT CGGAGGCCGC 
GGTCGACCCG GAGCTGGCCT TCCGGTCCGC 

40 

GCGTAACCGC CTGACCGCTG CCACCGGCCT 
CCCGACCCCC CTCTCGCTCG CCTCCCACCT 

45 GGCGGAGCCG GCAGCGGCCG CCCCCACGCC 

GATCGTGGGC ATGGCGTGCC GTTACCCGGG 
CCTGGTGGCC GGTGACGGGC ACACGCTCTC 

50 CGAGGGGCTG TACGACCCGG AGCCGGGGGT 

GTTCCTGCGT TCCGCGGCCG AGTTCGACGC 
CACGGCCATC GACCCGCAGC AGCGGTTGCT 

55 

GGCCGGCATC GTTCCGGACT CGCTGCGCGG 



GGGAGAGAAC CAGTTCGCGG TACGGCCCTC 6900 

GCCCGTCCCC GTCCCGGCCT CCGCTCGCAC 6960 

CGAGGACGCA CGGAACGACA CCTCGGACGT 7020 

CACCGTACTG ATCACCGGGG GCACCGGTGC 7080 

CCGGTCGGGC GCCGCGCGTC TGCTCCTGGT 7140 

GGGCG AACTC GTCGAGGAGC TGACGGCGCT 7200. 

CGTCGCCGAC CGGGACGCAC TGGCCGCGCT 7260 

CGTCGCCGTA CTGCACGCGG CAGGTGTGCT 7320 

CGACCGGGTG GACGCCGTAC TCCGGGACAA 7380 

GACCGCGGAC CTTCCGCTCG ACGCCTTCGT 7440 

CAACGGAGGG CAGGCCGTCT ACGCGGCCGC 7500 

GCGCCGGGCC AGGGGAGCCC GTGCCGCCTC 7560 

AATGGCCTCC GGAACGGCGG CGAAGTCCTT 7620 

CGAGCGCGCG CTCGACGTCC TCGACGACGT 7680 

GACGCACGCG GCCGGCGAGA GCTCCCTGCT 7740 

CGGGCGTTCG GTCACCCGCC GTACCTGGTC 7800 

GGCGCGTGCC GGCCATGCCG CGGACGACCG 7860 

CGACGGCGCA CCGGGCGGGA GCGGACAGGA 7920 

CGGCCCCTCG CCGGCGGAAC GCCGTCGTGC 7980 

CGGGATCCTG CGCCACGCCT CGGCCGACGC 8040 

CGGGTTCGAC TCCCTCACCG TTCTCGAACT 8100 

GAACCTGCCG AACACGCTGC TCTTCGACCA 8160 

GCACGACGAA CTGTTCGGTC CCGACAGCGA 8220 

GGTCATGGCC GACGAGCGTG AGCCGATCGC 8280 

CGGTGTGGCG TCGCCGGACG ACCTGTGGGA 8340 

CCCGTTCCCG GCCGACCGTC GCTGGGACGT 8400 

GCCGGGCAAG AGCTATGTAC GGGAAGGCGG 8460 

GGAGTTCTTC GGGATATCGC CGCGCGAGGC 8520 

GCTGGAGACG TCGTOGGAGG CGCTGGAGCG 8580 

CACCCGGACC GGTGTCTTCA GCGGCATCTC 8640 
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CCAGCAGGAC TACGCGACCC AGCTCGGGGA 
CACGGGGACC CTCGGCAGTG TCATCTCCGG 

5 

GCCGGCGCTG ACGGTGGACA CGGCGTGTTC 
GCAGTCGTTG CGGCGGGGTG AGTGTGATCT 

r0 GACGCCGACG GTGTTCGTGG AGTTCTCGCG 

CAAGGCGTTC GCGGAGGGTG CGGACGGGAC 
GGTGGAGCGG CTTTCCGACG CGCGCCGCAA 

75 CAGTGCGGTC AATCAGGACG GTGCGAGCAA 

GCAGCGGGTG ATCCGTGAGG CGCTGGCTGA 
GGTGGAGGCG CACGGTACGG GGACGGCGCT 

20 

GGCCACGTAC GGGCGGGAGC GGGTCGGCGA 
CATCGGGCAT GCGCAGGCGG CTGCGGGTGT 
GCGGCATGGG TCGTTGCCGC GGACGCTGCA 

25 

GGCTTCGGGT GCGGTGGAGC TGCTGACCGA 
GGTGCGGCGG GCCGCGGTGT CGGCGTTCGG 

30 GGAGGAAGCG CCGGCGGAGG CCGGGAGCGA 

CGACGCGGTG ACGGGTCCGT TGTCGTGGGT 
GGCGCAGGCG GTGCGGTTGC GTGAGTGTGT 

35 GGCGGGGTCG TTGGTGGTGT CGCGTGCGTC 

GGGGCGTGAG GAGTTGCTGG CGGGTCTGGA 
GTCTTCGGGG GCCGGTGCTG TGGTGCGGGG 

40 

GTTGTTCACG GGTCAGGGTG CGCAGTGGGT 
TGGGGTGTTT GCGGAGGTGC TGGATGAGGT 
GTCGTTGCGG GATGTGATGT TCGCGGATGC 

45 

GACGGAGTTT GCTCAGCCTG CGTTGTTTGC 
GGCTCGGGGT GTGGAGGTGT CGGTGGTGTT 
50 GTATGTGGCG GGGGTGTTGT CGTTGGGTGA 

GTTGATGGGT GGGTTGCCGG TCGGTGGGGG 
GGTGCGGGGG GTTGTTGAGG GGTTGGGGGA 

55 

GCGGTCGGTG GTGTTGTCGG GTGATGTGGG 



CGCCGCCGAC ACCTACGGCG GGCATGTGCT 8700 

TCGGGTTGCC TATGCGTTGG GGTTGGAGGG 8760 

GTCGTCGTTG GTGGCGTTGC ATCTGGCGGT 8820 

GGCGTTGGCC GGTGGGGTGA CGGTGATGGC 8880 

GCAGCGGGGG CTGGCGGCGG ACGGGCGGTG 8940 

GGCGTGGGCG GAGGGTGTGG GTGTGCTGCT 9000 

CGGTCATCGG GTGCTGGCGG TGGTGCGGGG 9060 

TGGGCTGACG GCGCCGAGTG GTCCGGCGCA 9120 

TGCGGGGCTG GTGCCCGCCG ACGTGGATGT 9180 

GGG TGATCCG ATCGAGGCGG GTGCGCTGCT 9240 

TCCGTTGTGG CTCGGGTCGT TGAAGTCGAA 9300 

GGGTGGTGTG ATCAAGGTGG TGCAGGGGAT 9360 

TGTGGATGCG CCGTCGTCGA AGGTGGAGTG 9420 

GACCCGGTCG TGGCCGCGGC GGGTGGAGCG 9480 

GGTGAGCGGG ACCAACGCCC ATGTGGTCCT 9540 

GCACGGGGAC GGCCCTGAAC CTGAGCGGCC 9600 

GCTTTCTGCG CGGTCGGAGG GGGCGTTGCG 9660 

GGAGCGGGTG GGTGCGGATC CGCGGGATGT 9720 

GTTCGGTGAG CGTGCGGTGG TGCTGGGCCG 9780 

TGTGGTGGCT GCCGGGGCTC CTGTGGGTGT 9840 

GAGTGCGGTG CGGGGTCGTG GGGTGGGGGT 9900 

TGGTATGGGG CGTGGGTTGT ATGCGGGGGG 9960" 

GTTGTCGGTG GTGGGGGAGG TGGATGGTCG 10020 

TGACTCGGTT TTGGGTGGGT TGTTGGGTCG 10080 

GTTGGAGGTG GCGTTGTTCC GGGCGTTGGA 10140 

GGGTCATTCG GTGGGGGAGG TGGCTGCTGC 10200 

TGCGGTGCGG TTGGTGGTGG CGCGGGGTGG 10260 

GATGTGGTCG GTGGGGGCGT CGGAGTCGGT 10320 

GTGGGTGTCG GTTGCGGCGG TGAATGGGCC 10380 

TGTGCTGGAG TCGGTGGTTG TCACGCTGAT 10440 



115 



EP 0 791 655 A2 



GGGGGATGGG GTGGAGTGCC GGCGGTTGGA TGTGTCGCAT GGGTTTCATT CGGTGTTGAT 10500 

OGAGCCGGTG TTGGGGGAGT TCCGGGGGGT TGTGGAGTCG TTGGAGTTCG GTCGGGTGCG 10560 

GCCGGGTCTC GTGGTGGTGT CGGGTCTGTC GGGTGGGGTG GTGGGTTCGG GGGACTTGGG 10620 

GGATCCGGGG TATTGGGTGC GTCATGCGCG GGAGGCGGTG CGTTTCGCGG ATGGGGTGGG 10680 

GGTGGTGCGT GGTCTGGGTC TGGGGACGTT GGTGGAGGTG GGTCCGCATG GGGTGCTGAC 10740 

GGGGATGGCG GGTCAGTGCC TGGAGGCCGG TGATGATGTG GTGGTGGTGC CGGCGATOCG 10800 

GCGGGGCCGT CCGGAGCGGG AGGTGTTCGA GGCGGCGCTG GCGACGGTGT TCACCCGGGA 10860 

CGCCGGCCTC GACGCCACGA CACTCCACAC CGGGAGCACC GGCCGACGCA TCGACCTCCC 10920 

CACCTACCCC TTCCAACACA ACCGCTACTG GGCAACCGGC TCAGTGACCG GTGCGACCGG 10980 

CACCTCGGCA GCCGCGCGCT TCGGCCTGGA GTGGAAGGAC CACCCCTTCC TCAGCGGCGC 11040 

CACGCCGATA GCCGGCTCCG GCGCGCTGCT CCTCACCGGC AGGGTGGGGC TCGCTGCCCA 11100 

CCCGTGGCTG GCCGACCACG CCATCTCCGG CACGGTGCTG CTCCCCGGAA CGGCGATCGC 11160 

CGACCTGCTG CTGCGGGCGG TCGAGGAGGT CGGCGCCGGA GGGGTCGAGG AACTGACGCT 11220 

CCATGAGCCC CTGCTCCTCC CCGAGCGAGG CGGCCTGCAC GTCCAGGTGC TGGTCGAGGC 11280 

GGCCGACGAG CAGGGACGGC GTGCCGTGGC AGTCGCCGCA CGCCCGGAGG GCCCTGGGCG 11340 

GGACGGTGAG GAACAGGAGT GGACCCGGCA CGCGGAAGGC GTGCTCACCT CCACCGAGAC 11400 

GGCCGTTCCG GACATGGGCT GGGCCGCCGG GGCCTGGCCG CCGCCCGGTG CCGAGCCGAT 11460 

CGACGTCGAG GAGCTGTACG ACGCGTTCGC CGCGGACGGC TACGGCTACG GCCCGGCCTT 11520 

CACCGCACTG TCCGGCGTGT GGCGTCTCGG CGACGAACTC TTCGCCGAGG TGCGGCGGCC 11580 

CGCGGGGGGC GCGGGCACGA CCGGTGACGG TTTCGGCGTC CACCCCGCAC TCTTCGATGC 11640 

GGCCCTCCAC CCGTGGCGCG CCGGCGGGCT GCTGCCCGAC ACGGGCGGCA CCACCTGGGC 11700 

GCCGTTCTCC TGGCAGGGCA TCGCGCTCCA CACCACCGGA GCCGAGACGC TCCGCGTCAG 11760 

ACTGGCCCCT GCGGCCGGCG GCACCGAGTC GGCCTTCTCC GTACAGGCCG CCGACCCGGC 11820 

GGGCACCCCG GTCCTCACCC TCGACGCACT GCTGCTCCGC CCGGTGACCC TOGGGAGGGC 11880 

CGACGCGCCG CAACCGCTGT ACCGCGTCGA CTGGCAGCCG GTCGGCCAGG GGACCGAGGC 11940 

CTCCGGCGCC CAGGGCTGGA CGGTGCTCGG GCAGGCCGCG GCCGAGACGG TCGCGCAGCC 12000 

CGCCGCCCAT GCGGACCTCA CCGCCCTGCG TACGGCTGTG GCCGCGGCGG GAACACCCGT 12060 

GCCCCGGCTG GTGGTCGTGT CGCCGGTGGA CACCCGGCTG GACGAGGGGC CGGTGCTGGC 12120 

GGACGCCGAG GCTCGGGCCC GTCCGGGTGA CGGCTGGGAC GACGATCCCC TACGTGTCGC 12180 

CCTCGGGCGC GGCCTCACCC TGGTCCGGGA GTCGGTCGAG GACGAACGGT TGGCGGACTC 12240 
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CCGGCTCGTC GTCCTCACCC GTGGCGCGGT GGCGGCCGGT CCCGGCGATG TGCCGGACCT 12300 

GACAGGTCCG GCCCTGTGGG GGCTGCTCCG CTCCGCGCAG TCGCAGTATC CGGACCGCTT 123 60 

5 

CACCCTCATC GACGTGGACG ATTCCCCCGA GTCCCGTGCG GCTCTGCCCC GGGCTCTGGG 12420 

ATCGGCCGAG CGACAACTCG CCCTGCGGAC GGGCGACGTG CTGGCGCCGG CCCTGGTCCC 12480 

10 GATGGCCACC CGGCCGGCGG AGACCACTCC AGCGACGGCG GTCGCCTCGG CGACAACACA 12540 

GACACAGGTC ACCGCGCCCG CTCCCGACGA CCCGGCTGCG GATGCCGTGT TCGACCCGGC 12600 

GGGCACCGTA CTGATCACCG GCGGCACCGG CGCCCTGGGA CGGCGTGTCG CCTCGCACCT 12660 

15 CGCGCGCCGG TACGGCGTAC GCCACATGCT TCTGGTCAGC AGGCGTGGAC CGGACGCCCC 12720 

CGAGGCCGGT CCCCTGGAAC GGGAACTCGC CGGTCTCGGA GTCACCGCCA CCTTCCTGGC 12780 

ATCCGACCTC ACCGACATCG AGGCCGTACG GAAGGCCGTC GCCGCGGTGC CGTCGGACCA 12840 

20 

CCCGCTGACC GGTGTGGTGC ACACCGCCGG CGTGCTGGAC GACGGCGCCC TGACCGGCCT 12900 

GACCCGGCAA CGCCTCGACA CCGTGCTGCG GCCCAAGGCC GACGCCGTGC GGAACCTCCA 12960 

CGAGGCGACC CTCGACCGGC CGCTGCGCGC GTTCGTCCTG TTCTCCGCCG CCGCCGGACT 13020 

25 

CCTGGGCCGC CCCGGGCAGG CCTCCTACGC CGCCGCCAAC GCGGTCCTCG ACGCGCTCGC 13080 

GGGAGCCCGC CGCGCGGCCG GACTGCCCGC AGTGTCCCTG GCGTGGGGCC TGTGGGACGA 13140 

30 GCAGACGGGC ATGGCAGGAG GCCTCGACGA GATGGCCCTG CGCGTGCTGC GCCGGGACGG 13200 

CATCGCCGCG ATGCCTCCGG AGCAGGGGCT CGAACTGCTC GACCTGGCCC TGACCGGACA 13260 

CCGGGACGGA CCCGCCGTCC TCGTCCCCCT CCTCCTCGAC GGCGCGGCCC TGCGCCGCAC 13320 

35 GGCGAAGGAG CGCGGCGCGG CCACGATGTC CCCCTTGCTG CGCGCCCTGC TGCCCGCCGC 13380 

CCTGGGCCGC AGCGGTGGAG CCGGCGCCCC CGCGGCGGCC GACCGGCACG GCAAGGAGGC 13440 

GGACCCCGGT GCGGGACGCC TCGCAGGGAT GGTGGCACTC GAAGCGGCGG AGCGTTCCGC 13500 

40 

GGCCGTCCTT GAGCTGGTCA CCGAACAGGT CGCCGAGGTC CTCGGCTACG CGTCGGCCGC 13560 

GGAGATCGAG CCCGAACGAC CCTTCCGGGA GATCGGCGTC GACTCCCTGG CGGCGGTGGA 13620 

GCTGCGCAAC CGGCTCAGCC GTCTCGTCGG CCTGCGGTTG CCGACCACGC TGTCCTTCGA 13680 

45 

CCACCCCACG CCGAAGGACA TGGCGCAGCA CATCGACGGG CAGCTCCCCC GCCCGGCCGG 13740 

AGCCTCGCCC GCGGACGCAG CGCTGGAAGG GATCGGCGAC CTCGCGCGGG CGGTCGCCCT 13800 

50 GCTGGGCACG GGCGACGCCC GCCGGGCCGA GGTACGAGAG CAGCTCGTCG GACTGCTGGC 13860 

CGCGCTCGAC CCACCTGGGC GGACGGGCAC CGCCGCACCC GGCGTCCCCT CCGGTGCCGA 13920 

TGGCGCGGAA CCGACCGTCA CGGACCGGCT CGACGAGGCG ACCGACGACG AGATCTTCGC 13980 

CTTCCTGGAC GAGCAGCTGT GACCACACCG TGGACCGACC GCATGCCGAG GAGTTGGTGG 14040 
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CAGCAATGAC CGCCGAGAAC GACAAGATCC 
TGCACCGGAC CAAGTCCCGC CTGGCCGAGG 

5 

TCGTGGGCAT GGCGTCCCGT TACCCGGGCG 
TGGTGGCAGC CGGTACGGAC GCGGTCTCCG 

1Q AGGGGCTGTA CGACCCCGAT CCGGAGGCGG 

TCCTGCACTC GGCGGCCGAG TTCGACGCGG 
CGGCGATGGA TCCGCAGCAG CGGTTGCTGC 

15 CGGGGATCGT CCCCGCGTCG CTGCGCGGCA 

ACGACGACTA CGGGTCGCGG TTCGACTCGG 
ACGGCAGCGC CGGCAGCATC GCGTCCGGTC 

20 CGGCGCTCAC GGTGGACACG GCGTGTTCGT 

AGTCGTTGCG GCGGGGTGAG TGTGATCTGG 
CGCCGACGGT GCTCGTGGAG TTCTCGCGGC 

25 

AGGCGTTCGC GGAGGGTGCG GACGGGACGG 
TGGAGCGGCT CTCCGACGCC CGCCGCAATG 

30 GTGCGGTCAA TCAGGACGGT GCGAGCAACG 

AGCGGGTGAT CCGTCAGGCG CTGGCCGACG 
TCGAGGCGCA CGGCACCGGC ACACCCCTCG 

35 CCACCTATGG CAGTGAGCGC CAGGGCCAAG 

ACATCGGGCA TGCGCAGGCG GCTGCGGGTG 
TGCGGCATGG GTCGTTGCCG CGGACGCTGC 

40 

GGGCTTCGGG TGCGGTGGAG CTGCTGACCG 
GGGTGCGGCG GGCCGCGGTC TCGGCGTTCG 
TGGAGGAAGC GCCGGCGGAG GCCGGGAGCG 

45 

CCGACGCGGT GACGGGTCCG TTGTCGTGGG 
GGGCGCAGGC GGTGCGGTTG CGTGAGTGTG 

50 TGGCGGGGTC GTTGGTGGTG TCGCGTGCGT 

GGGGGCGTGA GGAGTTGCTC GCGGGTCTGG 
TGTCCGGGGG CGTGTCTTCG GGGGCCGGTG 

55 GTGGGGTGGG GGTGTTGTTC ACGGGTCAGG 



GCAGCTACCT GAAGCGTGCC ACCGCCGAAC 14100 

TCGAGTCGGC GAGCCGCGAG CCGATCGCGA 14160 

GTGTCGCGTC GCCGGACGAC CTGTGGGACC 14220 

CGTTCCCCGT CGACCGTGGC TGGGACGTCG 14280 

TGGGGCGTAG TTACGTGCGG GAGGGCGGGT 14340 

AGTTCTTCGG GATCTCGCCC CGTGAGGCGG 14400 

TGGAGACGTC GTGGGAGGCG CTGGAGCGGG 14460 

CCCGTACCGG CGTCTTCACC GGCGTCATGT 14520 

CTCCGCCGGA GTACGAGGGC TACCTCGTGA 14580 

GGGTTGCCTA TGCGTTGGGG TTGGAGGGGC 14640 

CGTCGTTGGT GGCGTTGCAT CTGGCGGTGC 14700 

CGTTGGCCGG TGGGGTGACG GTGATGGCGA 14760 

AGCGGGGGCT GGCGGCGGAC GGGCGGTGCA 14820 

CGTGGGCCGA GGGTGTGGGC GTGCTGCTGG 14880 

GCCATCGGGT GCTGGCGGTG GTGCGGGGCA 14940 

GGCTGACGGC GCCGAGTGGT CCTGCGCAGC 15000 

CGGGGCTGAC GCCCGCCGAC GTCGACGCGG 15060 

GCGACCCCAT CGAGGCGGGT GCGTTGCTOG 15120 

GTCCGTTGTG GTTCGGGTCG TTGAAGTCGA 15180 

TGGGTCGCGT GATCAAGGTG GTGCAGGCGA 15240* 

ATGTGGATGC GCCGTCGTCG AAGGTX5GAGT 15300 

AGACCCGGTC GTGGCCGCGG CGGGTGGAGC 15360 

GGGTGAGCGG GACCAACGCC CATGTGGTCC 15420 

AGCACGGGGA CGGCCCTGAA CCCGAGCGGC 15480 

TCCTTTCTGC GCGGTCGGAG GGGGCGTTGC 15540 

TGGAGCGGGT GGGTGCGGAT CCGCGGGATG 15600 

CGTTCGGTGA GCGTGCGGTG GTGGTCGGCC 15660 

ATGTGGTGGC TGCCGGGGCT CCTGTGGGTG 15720 

CTGTGGTGCG GGGGAGTGCG GTGCGGGGTC 15780 

GTGCGCAGTG GGTTGGTATG GGGCGTGGGT 15840 
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TGTATGCGGG 


GGGTGGGGTG 


TTTGCGGAGG 


TGCTGGATGA 


GGTGTTGTCG GTGGTGGGGG 


15900 


AGGTGGGGGG 


TTGGTCGTTG 


CGGGATGTGA TGTTCGGCGA 


CGTCGACGTG GACGCGGGTG 


15960 


CCGGGGCTGA 


TGCGGGTGTC 


GGTTCGGGTC 


TTGGTGTGGG 


TGGGTTCTTG GGTCGGACGG 


16020 


AGTTTGCTCA 


GCCTGCGTTG 


TTTGCGTTGG AGGTGGCGTT 


GTTCCGGGCG TTCGAGGCTC 


16080 


GGGGTGTGGA 


GGTGTCGGTG 


GTGTTGGGTC 


ATTCGGTGGG 


GGAGGTGGCT GCTGCGTATG 


16140 


TGGCGGGGGT 


GTTGTCGTTG 


GGTGATGCGG 


TGCGGTTGGT 


GGTGGCGCGG GGTGGGTTGA 


16200 


TGGGTCGGTT 


GCCGGTCGGT 


GGGGGGATGT GGTCGGTGGG 


GGCGTCGGAG TCGGTGGTGC 


16260 


GGGGGGTTGT 


TGAGGGGTTG 


GGGGAGTGGG 


TGTCGGTTGC 


GGCGGTGAAT GGGCCGCGGT 


16320 


CGGTGGTGTT 


GTCGGGTGAT 


GTGGGTGTGC 


TGGAGTCGGT 


GGTTGCCTCG CTGATGGGGG 


16380 


ATGGGGTCGA 


GTGCCGGCGG 


TTGGATGTGT 


CGCATGGGTT 


TCATTCGGTG TTGATGGAGC 


16440 


CGGTGTTGGG 


GGAGTTCCGG 


GGGGTTGTGG 


AGTCGTTGGA 


GTTCGGTCGG GTGCGGCCGG 


16500 


GTGTGGTGGT 


GGTGTCGAGT 




GGGTGGTGGG 


TTCGGGGGAG TTGGGGGATC 


16560 


CGGGGTATTG 


GGTGCGTCAT 


GCGCGGGAGG 


CGGTGCGTTT 


CGCGGATGGG GTGGGGGTGG 


16620 


TGCGTGGTCT 


GGGTGTGGGG 


ACGTTGGTGG 


AGGTGGGTCC 


GCATGGGGTG CTGACGGGGA 


16680 


TGGCGGGTGA 


GTGCCTGGGG 


GCCGGTGATG 


ATCTCGTGGT 


GGTGCCGGCG ATGCGGCGGG 


16740 


GCCGTCCGGA 


GCGGGAGGTG 


TTCGAGGCGG 


CGCTGGCGAC 


GGTGTTCACC CGGGACGCCX5 


16800 


GCCTGGACGC 


CACGACACTC 


CACACCGGGA GCACCGGCCG 


ACGCATCGAC CTCCCCACCT 


16860 


ACCCCTTCCA 


ACACGACCGC 


TACTGGCTGG 


CCGCCCCGTC 


CCGGCCCAGG ACGGACGGGC 


16920 


TGTCGGCGGC 


GGGTCTGCGC 


GAGGTGGAGC 


ACCCCCTGCT 


CACCGCCGCC GTGGAACTGC 


16980 


CCGGCACCGA 


CACCGAGGTG 


TGGACCGGCC 


GCATATCCGC 


TGCCGACCTG CCCTGGCTCG 


17040 


CCGACCACCT 


GGTGTGGGAC 


CGAGGCGTGG 


TGCCGGGGAC 


CGCGCTGCTG GAGACGGTGC 


17100 


TCCAGGTGGG 


AAGCCGGATC 


GGTCTGCCGC 


GCGTCGCCGA 


ACTGGTCCTC GAGACGCCGC 


17160 


TGACCTGGAC 


GTCGGACCGC 


CCGCTCCAGG 


TCCGGATCGT 


CGTGACCGCT GCCGCCACCG 


17220 


CCCCCGGGGG 


CGCGCGTGAG 


CTGACCCTCC ACTCGCGGCC 


CGAGCCCGTG GCCGCCTCCT 


17280 



CGTCCTCCCC GAGTCCCGCC TCTCCCCGGC ACCTCACGGC GCAGGAGAGC GACGACGACT 17340 

GGACCCGGCA TGCCTCAGGG CTGCTCGCCC CGGCTGCCGG CCTCGCCGAC GACTTCGCCG 17400 

50 AGCTCACCGG CGCCTGGCCC CCCGTCGGCG CCGAGCCCCT CGACCTCGCC GGTCAGTACC 17460 

CGCTCTTCGC AGCCGCCGGA GTGCGCTACG AAGGCGCCTT CCGAGGGCTC CGCGCGGCAT 17520 

GGCGTCGAGG CGACGAGGTC TTCGCCGACG TACGGCTGCC CGACGCGCAC GCGGTCGACG 17580 

CTGATCGTTA CGGGGTGCAC CCCGCCCTGC TCGACGCGGT GCTCCACCCG ATCGCGTCGC 17640 
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TGGACCCGCT GGGCGACGGC GGGCACGGTC TGCTGCCGTT CTCCTGGACC GACGTACAGG 17700 
GACACGGCGC CGGCGGACAC GCCCTCCGGG TACGGGTGGC GGCCGTCGAC GGCGGCGCGG 17760 
TGTCGGTCAC CGCGGCCGAC CACGCGGGCA ACCCGGTGTT ATCCGCCCGG TCCCTGGCAC 17820 
TGCGTCGTAT CACCGCGGAC CGGCTTCCCG CCGCGCCCGT CGCCCCTCTC TACCGCGTGG 17880 
ACTGGCTGCC GTTCCCGGGT CCGGTGCCCG TATCCGCGGG CGGCCGCTGG GCGGTCGTCG 17940 
GACCCGAGGC CGAAGCCACG GCTGCCGGAC TGCGTGCGGT GGGCCTCGAC GTGCGTACCC 18000 
ATGCGCTCCC CCTCGGAGAG CCCCTGCCTC CGCAGGCCGG TACCGACGCG GAGGTGATCA 18060 
TCCTCGACCT GACCACCACC GCAGCCGGCC GTACGGCGTC GGACGGGGGG CGGCTCAGTC 18120 
TCCTCGACGA GGTGCGTGCG ACGGTCCGCC GGACCCTCGA AGCCGTACAG GCCCGCCTCG 18180 
CCGACACCGA AACGGCCCCC GACGTCGACG TCCGTACGGC CGCGCGCCCC CGCACAGCCG 18240 
CCCGTACAAG CCCCCGCGTG GACACCCGCA CGGGAGCCCG CACCGCTGAC GGCCCCCGGC 18300 

TCGTCGTCCT GACCCGGGGC GCGGCCGGAC CCGAGGGAGG CGCGGCCGAT CCCGCGGGTG 18360 

CCGCTGTCTG GGGGCTCGTC CGGGTCGCCC AGGCCGAACA GCCCGGCCGC TTCACCCTGG 18420 

TGGACGTCGA CGGCACCCAG GCGTCGCTGC GGGCCCTCCC CGGTCTCCTG GCCACGGATG 18480 

CCGGCCAGTC GGCCGTGCGC GACGGACGTC TCACCGTCCC GCGCCTCGTC CCGGTGGCCG 18540 

ACCCCGTCCC CCACGGCGGC GGCACGGCGG CCGACGGGAC GGGTGCCGGC GAGCCGTCCG 18600 

CGACCCTGGA CCCCGAAGGC ACCGTGCTGA TCACCGGCGG CACCGGAGCA CTGGCCGCGG 18660 
3 AAACCGCCCG GCACCTGGTC GACCGGCACA AGGTGCGCCA TCTCCTGCTC GTGGGCAGGC 18720 

GCGGTCCCGA CGCACCCGGC GTCGATCGAC TGGTCGCCGA GTTGACCGAG TCGGGTGCCG 18780 

AGGTCGCCGT ACGGGCCTGT GACGTCACGG ACCGCGACGC CCTGCGCCGC CTCCTCGACG 18840 

CACTCCCCGA CGAACACCCG CPGACCTGCG TGGTGCACAC CGCCGGGGTG CTCGACGACG 18900 

GCGTGCTCTC CGCCCAGACG GCCGAGCGGA TCGACACGGT GCTCCGGCCC AAGGCCGACG 18960 

CCGCCGTCCA CCTGGACGAG CTGACCCGGG AGATCGGACG GGTGCCCCTG GTGCTGTACT 19020 

CCTCGGTCTC GGCCACCCTG GGCAGCGCGG GGCAGGCCGG GTACGCGGCG GCCAACGCCT 19080 

TCATGGACGC GCTGGCCGCC CGGCGGTGCG CCGCCGGGCA CCCCGCGCTG TCGCTCGGCT 19140 

GGGGCTGGTG GTCCGGGGTG GGTCTCGCCA CCGGACTGGA CGGAGCGGAC GCGGCGCGGG 19200 

TCAGGCGCTC GGGTCTCGCC CCGCTCGACG CCGGCGCCGC ACTGGACCTG CTCGACCGGG 19260 

CGCTGACCCG GCCCGAGCCG GCCCTCCTCC CCGTGCGGCT CGACCTCCGC GCCGCGGCCG 19320 

GTGCCACCGC TCTCCCGGAG GTCCTGCGTG ACCTGGCCGG CGTACCGGCG GACGCCCGCA 19380 

GCACGCCCGG GGCCGCGGCG GGCACCGGGG ACGAGGACGG TGCCGTCCGC CCTCCCCCCG 19440 
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CCCCGGCCGA CGCCGCCGGG ACGCTGGCCG CGCGGCTCGC GGGACGTTCC GCACCCGAGC 19500 

GTACGGCTCT CCTGCTCGAC CTGGTGCGGA CCGAGGTCGC GGCGGTGCTC GGACACGGCG 19560 

5 

ACCCCGCCGC GATCGGCGCC GCCCGCACCT TCAAGGACGC CGGATTCGAC TCCCTCACCG 19620 

CTCTCGACCT CCGCAACCGG CTGAACACAC GCACCGGACT GCGGCTGCCC GCGACCCTCG 19680 

1Q TCTTCGACCA CCCCACACCG CTCGCCCTCG CCGAACTCCT GCTCGACGGG CTGGAGGCGG 19740 

CCGGTCCAGC GGAACCGGCC GCTGAGGTCC CGGACGAAGC GGCCGGTGCC GAGACCCTGT 19800 

CCGGCGTGAT CGACCGGCTG GAACGCAGCC TCGCCGCGAC CGACGACGGC GACGCCCGGG 19860 

15 TCCGCGCGGC ACGGCGGCTG CGCGGCCTGC TGGACGCGCT CCCCGCCGGT CCCGGTGCCG 19920 

CGTCCGGTCC GGATGCCGGA GAGCACGCCC CCGGTCGCGG CGACGTGGTG ATCGACCGGC 19980 

TCAGGTCGGC CTCCGACGAC GACTTGTTCG ACCTGCTCGA CAGCGACTTC CAGTGAGCCG 20040 

20 GACCGCGCCG CGCGCCGACC GCTGAACCGC TCTTCACCCA GACCCACGAG ACCACGCCTG 20100 

AGGAGAACCG TGTCTGCGAC CAACGAGGAG AAGTTGCGGG AGTACCTCCG GCGCGCGATG 20160 

GCCGACCTGC ACAGCGCACG AGAGCGGTTG CGCGAGGTCG AGTCGGCGAG CCGTGAGCCG 20220 

25 

ATCGCGATCG TGGGCATGGC GTGCCGTTAC CCGGGCGGTG TGGCGTCGCC GGAGGAGCTG 20280 

TGGGACCTGG TGGCCGCCGG TACGGACGCG ATCTCCCCGT TCCCCGTCGA CCGCGGCTGG 20340 

3Q GACGCCGAGG GTCTGTACGA CCCGGAGCCG GGGGTGCCGG GCAAGAGCTA CGTGCGCGAG 20400 

GGCGGGTTCC TCCACTCGGC GGCCGAGTTC GACGCGGAGT TCTTCGGGAT CTCGCCGCGT 20460 

GAGGCGGCGG CGATGGATCC GCAGCAGCGG TTGCTGCTGG AGACGTCGTG GGAGGCGCTG 20520 

35 GAGCGGGCCG GGATCGTCCC CGCGTCGCTG CGCGGCACCC GTACCGGCGT CTTCACCGGC 20580 

GTCATGTACC ACGACTACGG CAGCCACCAG GTCGGCACCG CCGCCGATCC CAGTGGACAG 20640 

CTCGGCCTCG GCACCGCGGG GAGCGTCGCC TCGGGCCGGG TGGCGTACAC CCTCGGTCTA 20700 

40 

CAGGGGCCGG CCGTGACCAT GGACACGGCA TGCTCGTCCT CGCTGGTGGC GTTCCACCTG 20760 

GCGGTGCAGT CGTTCCGGCG GGGCGAGTGC GATCTCGCGT TGGCCGGCCG GGCGACGGTC 20820 

TTGGCGACGC CCACGGTGTT CGTGGAGTTC TCGCGGCAAC GGGGGCTGGC GGCGGACGGA 20880 

45 

CGGTGCAAGG CGTTCGCGGA GGGCGCCGAC GGCACGGCGT GGGCCGAGGG CGCCGGTGTG 20940 

CTGCTGGTGG AGCGGCTCTC CGACGCCCGC CGCAACGGCC ATCGGGTGCT CGCGGTGGTG 21000 

50 CGGGGCAGCG CGGTCAACCA GGACGGTGCC AGCAACGGCC TCACCGCACC CAGCGGGCCC 21060 

GCCCAGCAGC GGGTGATCCG TGACGCGCTG GCCGACGCGG GGCTGACGCC CGCCGACGTG 21120 

GACGCGGTCG AGGCGCACGG CACCGGCACA CCGCTCGGCG ACCCGATCGA GGCCGGCGCG 21180 

55 CTGATGGCCA CCTACGGCAG TGAACGGGTG GGCGACCCGC TGTGGCTGGG TTCGCTGAAG 21240 
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TCGAACATCG GACACACCCA GGCCGCCGCC 
GCGTTACGGC AGTCCGAGCT GCCGCGCACC 

5 

GAATGGGACG CGGGCGCCGT GC AACTGCTC 
CACAGGCCCC GGCGGGCCGC GGTCTCCGCC 

10 ATCATCGAGG AACCGCCCGC GGCCGGTGAC 

GGCGAGGCGA CCGCGTCCCC CTCCACCGCG 
GGGCCGCTGT CCCCCTCCTC CCCGGCCGTC 

15 GCCCTGCGCG CCCAGGCCGC CCGCCTGCGG 

CCGACCGACA TCGGCCACGC CCTGGCCGCC 
CTGCTCGGCG ACGACGGAGC CCCGGTCGAC 

20 

ACCCCCGACG CCGTCCACGG CACCGCGGCG 
GGCCAGGGTT CCCAGTGGGC CGGGATGGGC 
GCCGCCGAAC TGGACCGCTG CCAGGGCGCG 

25 

GACGTGCTGC GCGGCGCGCC CGCGGCGCCC 
GCCACCTTCG CCGTCATGGT GGGACTCGCC 

30 GCCGCCGTCA TCGGCCACTC CCAGGGCGAG 

TCCCTGGAGG ACGCCGCCCG GATCGTGGCC 
GCCGGGCGGG GCGGCATGGC CTCGGTGGCC 

35 GCCGGCGGCG TCGAGATCGC CGCCGTCAAC 

CCCGGCGCCC TGGAGGCGTT GCTCGTCACG 
ATCGACGTCG ACTACGCGTC CCACTCCCAC 

40 

ACCGTCCTCG GCCCCGTCCG GCCGCGGAGG 
GCGGCGCTCC TCGACACCGC CACCCTGGAC 
CCGGTGCGCT TCGAGCCGAC CGTACGCGCC 

45 

GAGTGCTCCG CGCATCCCGT CCTGACCGTC 
GGCGCGGTCC CGGCCCTCGC TTCGCTGCGC 
50 ACCTCCGCCG CCGAGGCCCA GGTCGTCGGC 

GGCGCCGGCC GGGTGGACCT GCCGACCTAC 
CCCGCCCGGC CCGACTCCGC GGCGACGGCC 
GGAGACCGGC TCGGCTACCA CGTCGCGTGG 



GGAGCCGCCG GCGTCATCAA GATGGTGCAG 21300 

CTGCACGTCG ACGCGCCCTC GGCCAAGGTC 21360 

ACCGGCGTCC GGCCATGGCC CCGGCGCGAG 21420 

TTCGGCGTCA GCGGCACCAA CGCCCACGTC 21480 

ACCTCGCCCG CCGGCGACAC CCCTGAGCCG 21540 

GCCGGGCCGT CGTCCCCCTC CGCGGTGGCC 21600 

GTCTGGCCCC TGTCCGCCGA GACCGCCCCC 21660 

GCGCACCTCG AACGCCTCCC CGGCACCTCG 21720 

GAACGCGCCG CCCTCACCCG ACGCGTCGTG 21780 

GCACTCGCCG CCCTCGCCGC CGGCGAGACC 21840 

GACATCCGCC GGGTCGCCTT CGTGTTCCCC 21900 

GCCGAACTGC TGGACACGGC CCCGGCCTTC 21960 

CTCTCCCCGT ACGTGGACTG GAACCTCGCG 22020 

GGCCTCGACC GGGTCGACGT CGTCCAGCCG 22080 

GCGCTGTGGC GCTCCCTCGG GGTCGAACCC 22140 

ATCGCCGCGG CCTGCGTGGC GGGCGCGCTC 22200 

CTGCGCTCCC AGGTCATCGC CCGCGAACTG 22260 

CTGCCCGCGG CGGAGGTCGA GGCCCGCCTG 22320 

GGCCCCGGCT CGACCGTCGT CTGCGGAGAG 22380 

CTGGAGAGCG AAGGCACCCG GGTCCGCCGC 22440 

TACGTCGAGA GCATCCGGGC GGAACTCGCC 22500 

GGCGACGTGC CCTTCTACTC CACCGTCGAG 22560 

GCCGACTACT GGTACCGCAA CCTGCGCCTC 22620 

ATGCTCGACG ACGGCGTCGA CGCGTTCGTG 22680 

GGCGTGCGCC AGACCGTGGA GAGCGCCGGC 22740 

CGCGACGAGG GCGGGC TGCG GCGCTTCCTC 22800 

GTCCCCGTGG ACTGGGCGAC GCTCCGCCCA 22860 

GCCTTCCAGC GCGAACGCCA CTGGGTCGGC 22920 

GCCACGACCG GTGACGACGC CCCGGAGCCC 22980 

AAGGGACTGC GCTCCACCAC CGGCGGCTGG 23040 
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CGCCCCGGCC 

ACCCTGGAAC 

GCACGCACCG 

GCCGTCACCG 

GCCGTGCCCG 

AGCACCGCAC 

CCGTGCACCG 

GAGGTGTGGG 

CTCGCCGGCG 

GTCTTCGGCC 

CGGGGCACCG 

CTCCTCGAGG 

GCCGCCGCCG 

GCCGCACTGG 

CTGGTGCCCC 

GCCAAGACCA 

CTGGTGCTGT 

GCCGCCAGCG 

TTCTCCGTGG 

TTCCTCAGCC 

CGCATGCTGG 

GCCGCCTCCT 

CGGCTGCGCG 

CGCGAGCTGA 

GCACACGCCG 

TTCCGCGACC 

AGCACCGGGC 

GCCCGGCACC 

GACACCCCGG 

GGCGGGGTGC 



TGCGCCTGCT 

AGGCGGTCGC 

GACGCGCCGA 

GCGTCGTCTC 

TCGCCGTCAC 

CGCTGTGGAC 

CCGGTGCCCA 

GCGGCCTCAT 

TCCTCGCCGA 

GCCGGGTCCT 

TCCTCATCGC 

ACGGCGCGGA 

GACTGACCGG 

CCGCGCTGCT 

TGGCGCCGCT 

CGGCCGCCGG 

TCTCCTCGGT 

CGCACCTCGA 

CCTGGAGCCC 

GGCGCGGGCT 

AGCGCGGCAG 

ACACCTGGGT 

CGGCCGAACT 

CCGCGCAGTC 

CCGCCGTCCT 

TCGGCTTCGA 

TCAAACTGCC 

TCGGTGAGGA 

TACGGACGGA 

AGTCCCCCGA 



GATCGTGCCC ACCGGGGACC AGTACACCGC CCTCGCCGAC 23100 

CTCCTTCGGC GGAACGGTCC GCCGCGTCGC CTTCGACCCG 23160 

GCTGTTCGGC CTGCTCGAGA CGGAGATCAA CGGCGACACC 23220 

GCTGCTCGGA CTGTGCACCG ACGGCAGGCC GGACCACCCC 23280 

CGCCACCCTC GCCCTCGTCC AGGCCCTGGC CGACCTCGGC 23340 

CGTCACCTGC GGCGCGGTCG CCACCGCCCC CGACG AACTG 23400 

GCTGTGGGGC CTGGGCCGGG TGGCCGCGCT GGAGCTGCCC 23460 

CGACCTTCCC GCGCGGCCCG ACGCCCGGGT CCTGGACCGT 23520 

ACCCGGCGGC GAGGACCAGA TCGCCGTACG GATGGCGGGC 23580 

GCGGAACCCG GCCGACTCCC GGCCCCCGGC CTGGCGCGCC 23640 

CGGCGACCTC ACGACGGTGC CCGGCCGACT GGTCCGGTCC 23700 

CCGCGTGGTG CTGGCCGGAC CCGACGCCCC CGCACAGGCC 23760 

CGTCTCCCTC GTCCCCGTGC GCTGCGACGT CACCGACCGC 23820 

CGACGAGCAC GCGCCCACCG TCGCCGTGCA CGCCCCGGCC 23880 

GCGGGAGACG GCACCCGGCG ACATCGCCGC CGCCCTCGCC 23940 

CCACCTGGTC GACCTGGCGC CQGCCGCGGG CCTCGACGCG 24000 

CTCCGGAGTG TGGGGCGGCG CGGCCCAGGG CGGCTACGCG 24060 

CGCGCTCGCC GAACGCGCCC GCGCCGCGGG GGTGCCCGCG 24120 

CTGGGCCGGA GGCACGCCCG CCGACGGTGC CGAGGCGGAG 24180 

GGCTCCCCTC GACCCCGACC AGGCGGTGCG GACCCTGCGC 24240 

CGCCTGCGGT GCGGTCGCCG ACGTCGAGTG GAGCCGGTTC 24300 

GCGTCCCGCC GTACTCTTCG ACGACATCCC GGACGTGCAG 24360 

CGCCCCGAGC ACCGGAGACT CGACCACCTC CGAACTCGTC 24420 

CGGCCACAAG CGGCACGCCA CCCTGCTGCG GCTGGTGCGC 24480 

CGGACAGTCC TCCGGCGACG CGGTGAGCAG CGCCCGCGCC 24540 

CTCGCTGACC GCCCTCGAAC TGCGCGACCG GCTCAGCACC 24600 

CACCTCCCTG GTCTTCGACC ACTCCAGCCC GGCCGCGCTC 24660 

ACTCCTCGGC CGGAACGACA CCGCCGACCG GGCCGGCCCC 24720 

CGAGCCCATC GCCATCATCG GCATGGCCTG CCGGCTGCCC 24780 

GGACCTGTGG GACCTGCTGA CCGGTCGGAC CGACGCCATC 24840 
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ACCCCCTTCC CGACCAACCG GGGATGGGAC 
TCGCCCGGGC ACCACACCTA CGTGCGCGAG 

5 

GACCCCGGCT TCTTCGGCAT CAGCCCCCGC 
CTGATCCTGG AGACGTCCTC GGAGTCCTTC 

10 CGCGGCAGCC GCACCGGGGT CTTCGTCGGC 

CAGGACGGCG ACGAGAACTT CGACGGCTAC 
TCCGGCCGGC TCTCCTACGT CTTCGGACTG 

15 TGCTCGGCCT CCCTGGCCGC ACTGCACCTG 

GACTACGCCC TCGCCGGCGG GGCCACGGTG 
GCCCGTCAGC GAGCGGTGTC GCCGGACGGC 

20 

GGGGTCGGTC TCGCCGAGGG AGCCGGGATG 
AAGAAGGGCC ATCCGGTACT GGCGGTGGTG 
AGCAACGGCC TCACCGCACC CAGCGGGCCC 

25 

GCCGACGCGG GGCTGACGCC CGCCGACGTG 
CCGCTCGGCG ACCCCATCGA GGCCGGCGCG 

30 GACGGCCCGC TGTGGCTGGG TTCGCTGAAG 

GGCGTGGCCG GGGTGATCAA GATGGTGCTG 
CTGCACGCGT CGACGGCGTC GTCCAGGATC 

35 GACGAGGCCA GGCCCTGGCT CCAGCGGGCC 

TTCGGCATCA GCGGCACCAA CGCGCACCTC 
CCCGAACTGC TCGCGCCCGA ACCGGCCGCC 

40 

CACGAGGTGA CCGTGCCCCT GATGATGTCC 
GCGCGGCGCC TGCGCGCCGA CCTGCTCGCC 
TACACCCTCA TCACCACCCG CACCCGGTTC 

45 

TTCACGGAGC TGATCGCGGC CCTCGACGAC 
CTGCGGGGCA CCGCCGGCAC CTCCGACCAG 
50 CAGTGGCCCG AGATGGCCGA CGGGCTGCTG 

GAGACCGCCC GCGCCTGCGA CCTCGCGCTC 
GTACTGCGCC GGGAACCCGG CGCGCCCTCG 
CTGTTCACCA TGATGGTCTC GCTCGCCGAG 



AACGAGACCC TCTACGACCC CGACCCCGAC 24900 

GGCGGGTTCC TGCACGACGC GGCCGAGTTC 24960 

GAGGCCCTGG CCATGGACCC GCAGCAGCGG 25020 

GAACGGGCCG GCATCGACCC GGTCGAACTG 25080 

ACCAACGGAC AGCACTACGT GCCGCTCCTC 25140 

ATCGCCACCG GCAACTCCGC CAGCGTGATG 25200 

GAGGGCCCCG CCGTCACCGT CGACACCGCC 25260 

GCGGTGCAGT CACTGCGCCG CGGCGAATGC 25320 

ATGTCCACCC CCGAGATGCT GGTGGAGTTC 25380 

CGCAGCAAGG CGTTCGCGGA GGCGGCCGAC 25440 

CTGCTCGTGG AGCG g C TGTC GGAGGCGCAG 25500 

CGGGGCAGTG CCGTCAACCA GGACGGTGCC 25560 

GCCCAGCAGC GGGTGATACG GGAGGCGCTG 25620 

GACGCGGTCG AGGCGCACGG CACCGGCACG 25680 

CTGCTCGCCA CGTACGGCCG GGACCGGCGC 25740 

TCGAACATCG GGCACACCCA GGCCGCCGCC 25800 

GCGCTGCGCC ACGGCGAGCT GCCGCGCACC 25860 

GATTGGGACG CGGGCGCCGT GGAGTTGCTG 25920 

GAGGGGCCGC GCCGGGCGGG CATCTCCTCG 25980 

GTCATCGAGG AGCCGCCGGA GCCCACCGCG 26040 

GACGGCGACG TCTGGTCCGA GGAGTGGTGG 26100 

GCGCACAACG AAGCCGCCCT GCGCGACCAG 26160 

CACCCCGAGC TGCACCCGGC CGACGTCGGC 26220 

GAGCAGCGGG CCGCCGTCGT CGGCGAGAAC 26280 

CTCGTCGAAG GCCGACCGCA CCCGCTCGTG 26340 

GTCGTGTTCG TCTTCCCCGG CCAGGGCTCG 26400 

GCCCGCTCCA GCGGCTCCGG CTCCTTCCTG 26460 

CGGCCCCACC TCGGCTGGTC CGTCCTGGAC 26520- 

CTCGACCGGG TCGACGTGGT GCAGCCCGTG 26580 

ACGTGGCGTT CGCTGGGCGT CGAACCGGCC 26640 
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GCGGTCGTCG GTCACTCCCA GGGCGAGATC GCCGCCGCCT 


ACGTCGCCGG 


CGCCCTGACG 


z o / u u 




CTGGACGACG C GGCGCGC AT CGTCGCCCTG CGCAGCCAGG 


CGTGGCTGCG 


GCTGGCCGGC 


Z D / O U 


5 


AAGGGCGGCA TGGTCGCCGT GACCCTGTCC GAACGCGACC 


TGCGTCCCCG 


CCTGGAGCCC 


Z DO^U 




TGGAGCGACC GGCTCGCCGT CGCCGCCGTC AACGGCCCCG AGACCTGCGC 


CGTCTCCGWj 


•y coon 
Z boOU 


10 


GACCCGGACCj CUC ±\iOCk>OA VjU IvjO i v_L»i»^- sj^vrt.^- a s^vjtv* ± vj 


CGGAGGGCGT 


GCACGCCCGC 






CCCATCCCCG GCG TLCsAt-AL. LljLLWjyLAt rc*j*_v-.ijt_AVjv* 


TCGACACGCT 


GGAGGCCCAC 


0*7 AAA 

Z / uuo 




/^rrv^pnp^Am n\^r"nrpp/vp r > r ,r vf^nr , n.cc > r' crzc&CC'VCCCi 


ACATCCCGTT 


All-mi rTV/~* /~1 X f**r> 

CTACTCGACG 


•? A £ A 

z / UoU 


15 


L» 1 C AC C WjAo O AV- 1\?A 1 Uv?A L.AV-wij^-i-wtv» \- J- vTuv-k.\»\jv-.\-^ 


ACTACTGGTA 


CCGCAACATG 


Z / lzU 




r*rrT*'hr y nr*nr* r vdr , 7^f >T V r VCCl'L n.C2kCSC\CClKCC CGCGCCCTGA 


TCGCCGACGG 


CCACGACGTG 


0*7 1 Q A 




Tfrriw^irzT rrcanrrcGrA CCCCATGCTG GCCGTCTCCC 


TCCAGGAGAC 


GATCAGCGAC 


Z /Z4U 


20 


GCCGGTTCCC CGGCGGCCGT CCTCGGCACC CTGCGGCGCG 


GCCAGGGCGG 


CCCCCGCTGG 


")7 "J A A 




CTGGGCGTCG CCCTCTGCCG CGCCTACACC CACGGCCTGG 


AGATCGACGC 


CGAGGCCATC 


Z / jOu 


25 


TTCGGCCCCG ACTCACGCCA GGTGGAACTG CCCACGTACC 


CCTTCCAGCG 


CGAGCGCTAC 


Z f *k£\J 


TGGTACAGCC CCGGCCACCG CGGTGACGAC CCCGCCTCCC 


TCGGTCTGGA 


CGCCGTCGAL 


Z / 4oU 




CACCCGCTGC TGGGCAGCGG CGTCGAACTG CCGGAGTCCG 


GTGACCGGAT 


GTACACCGCA 


Z r D4U 


30 


CGGCTGGGCG CCGACACCAC CCCGTGGCTG GCCGACCACG 


CGCTGCTGGG 


GTCGCCGCTG 


z / oyv 




CTGCCCGGCG CCGCCTTCGC CGACCTGGCG CTCTGGGCCG 


GCCGCCAGGC 


CGGCACCGGC 


Z r ODU 




CGCGTCGAGG AGCTCACCCT GGCCGCGCCC CTGGTGCTGC CCGGCTCCGG GGGTGTCCGG 


z / / zu 


35 


CTGCGGCTGA ACGTCGGCGC CCCGGGCACC GACGACGCCC 


GCCGCTTCGC 


CGTGCACGCC 


Z f / oU 




CGCGCCGAGG GCGCCACGGA CTGGACCCTG CACGCCGAGG 


GGCTGCTCAC 


CGCGCAGGAC 


0*7 Q/t A 

Z f o4U 




ACGGCCGACG CGCCGGACGC CTCGGCGGCC ACCCCGCCCC 


CCGGCGCCGA 


ACAACTGGAC 


z / yuu 


40 


ATCGGCGACT TCTACCAGCG CTTCTCCGAA CTCGGTTACG 


GCTACGGCCC 


GTTCTTCCGG 


Z / you 




GGACTGGTGA GCGCCCACCG CTGCGGCCCC GACATCCACG 


CGGAGGTCGC 


GCTGCCCGTC 


O Q AO A 
Z o UZ u 


45 


CAGGCGCAGG GCGACGCGGC CCGCTTCGGC ATCCATCCCG 


CGCTGCTGGA 


CGCGGCGCTG 


o q nan 


CAGACCATGA GCCTCGGGGG CTTCTTCCCC GAGGACGGCC GCGTCCGCAT GCCGTTCGCC 


OQ1 4A 
ZO ±4 v 




CTGCGCGGCG TTCGGCTGTA CCGCGCCGGA GCCGACCGGC 


TGCACGTGCG 


CGTCTCGCCC 


Z OZ w V 


50 


GTCTCCGAGG ACGCGGTCCG CATCAGGTGC GCCGACGGCG 


AGGGACGGCC 


GGTCGCCGAG 


28260 




ATCGAGTCCT TCATCATGCG GCCGGTCGAC CCGGGACAGC 


TCCTGGGCGG 


CCGCCCGGTC 


28320 




GGCGCCGACG CGCTCTTCCG CATCGCCTGG CGGGAACTCG 


CCGCCGGCCC 


GGGCACCCGT 


28380 


55 


ACCGGCGACG GCACCCCTCC CCCGGTGCGC TGGGTGCTGG 


CGGGACCCGA 


CGCGCTGGGC 


28440 
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CTGGCCGAGG CGGCCGACGC CCACCTGCCC GCCGTTCCCG GCCCGGACGG CGCACTGCCG 28500 

^ TCCCCGACGG GACGCCCGGC GCCGGACGCC GTCGTGTTCG CGGTCCGTCC CGGGACCGGC 28560 

GACGTCGCCG CCGACGCGCA CACCGTGGCC TGCCGGGTGC TGGACCTCGT CCAGCGCCGG 28620 

CTCGCGGCCC CGGAGGGCCC GGACGGCGCC CGCCTGGTGG TGGCCACCCG CGGCGCGGTC 28680 

10 GCCGTACGCG ACGACGCCGA GGTGGACGAC CCGGCCGCGG CCGCCGCGTG GGGCCTGCTG 28740 

CGCTCCGCGC AGGCCGAGGA GCCCGGCCGG TTCCTGCTCG TGGACCTGGA CGACGACCCG 28800 

GCGTCCGCCC GGGCGCTGAC CGACGCCCTC GCCTCCGGCG AACCGCAGAC CGCGGTCCGG 28860 

15 GCCGGGACGG TGTACGTGCC CCGGCTGGAG CGGGCCGCCG ACCGCACGGA CGGGCCGCTC 28920 

ACCCCGCCCG ACGACGGTGC CTGGCGGCTG GGCCGGGGCA CCGACCTCAC CCTCGACGGC 28980 

CTCGCCCTGG TGCCCGCCCC GGACGCCGAG GCGCCGCTGG AGCCCGGCCA GGTGCGCGTC 29040 

20 

GCCGTACGCG CCGCGGGCGT CAACTTCCGC GACGCCCTCA TCGCCCTCGG CATGTACCCG 29100 

GGCGAGGCGG AGATGGGAAC GGAGGGCGCC GGCACCGTCG TCGAGGTCGG CCCCGGCGTC 29160 

ACCGGTGTCG CCGTCGGCGA CCGCGTGCTC GGCCTGTGGG ACGGCGGCCT GGGCCCGCTG 29220 

25 

TGCGTGGCCG ACCACCGGCT GCTCGCCCCC GTCCCGGACG GCTGGTCCTA CGCCCAGGCC 29280 

GCCTCGGTCC CCGCGGTGTT CCTCAGCGCC TACTACGGTC TGGTCACCCT GGCCGGCCTC 29340 

30 kGGCCGGGGG AGCGGGTGCT CGTGCACGCC GCCGCCGGGG GCGTCGGCAT GGCCGCGGTG 29400 

CAGATCGCCC GCCACCTCGG CGCGGAGGTG CTGGCCACCG CGAGCCCCGG CAAGTGGGAC 29460 

GCCCTCCGCG CCATGGGCAT CACCGACGAC CACCTCGCCT CCTCCCGCAC CCTCGACTTC 29520 

35 GCGACCGCCT TCACCGGAGC GGACGGCACG TCCCGCGCGG ACGTCGTCCT GAACTCGCTC 29580 

ACCAAGGAGT TCGTGGACGC CTCCCTCGGG CTGCTCCGTC CGGGCGGCCG GTTCCTGGAG 29640 

CTGGGCAAGA CCGACGTCCG GGACCCCGAG CGGATCGCCG CCGAACACCC CGGGGTGCGC 29700 

40 

TACCGGGCGT TCGACCTCAA CGAGGCCGGA CCCGACGCAC TCGGCCGGCT GCTGCGGGAA 29760 

CTGATGGACC TGTTCGCCGC CGGCGTGCTG CACCCGCTGC CCGTCGTCAC CCACGACGTG 29820 

45 CGCCGGGCCG CGGACGCCCT GCGCACCATC AGCCAGGCCC GGCACACCGG AAAGCTCGTC 29880 

CTGACCATGC CGCCCGCCTG GCACCCGTAC GGCACGGTCC TGGTCACCGG TGGCACCGGC 29940 

GCCCTCGGCA GCCGCATCGC CCGCCACCTG GCGAGCCGGC ACGGCGTCCG CCGGCTGCTC 30000 

50 ATCGCCGCCC GCCGGGGCCC GGACGGCGAG GGCGCCGCGG AGCTGGTCGC CGACCTCGCC 30060 

GCCCTGGGCG CGTCGGCCAC CGTGGTCGCC TGCGACGTCT CCGACGCGGA CGCCGTCCGC 3 0120 

GGACTGCTCG CCGGCATACC GGCCGATCAC CCGCTGACGG CGGTGGTGCA CAGCACCGGC 30180 

55 

GTCCTCGACG ACGGCGTGCT GCCCGGGCTC ACCCCCGAGC GGATGCGGCG CGTGCTGCGG 30240 



126 



EP0 791 655 A2 



CCCAAGGTGG AGGCCGCCGT CCACCTGGAC GAACTCACCC GCGACCTCGA CCTGTCGGCG 30300 

TTCGTCCTCT TCTCCTCCAG CGCCGGTCTG CTGGGCAGCC CGGCCCAGGG CAACTACGCG 30360 

5 

GCGGCCAACG CCACCCTCGA CGCCCTCGCC GCCCGGCGCC GGTCCCTCGG CCTCCCGTCG 30420 

GTGTCACTCG CCTGGGGTCT GTGGTCCGAC ACCAGCCGGA TGGCACACGC ACTGGACCAG 3 0480 

10 GAGAGCCTCC AGCGGCGCTT CGCCCGCAGC GGCTTCCCGC CCCTGTCCGC CACGCTGGGC 3 0540 

GCCGCGCTGT TCGACGCCGC CCTGCGGGTC GACGAGGCCG TGCAGGTCCC CATGCGGTTC 3 0600 

GACCCGGCCG CGCTGCGCGC CACCGGAAGC GTCCCCGCCC TGCTGTCGGA CCTCGTCGGG 30660 

15 TCCGCCCCGG CGACCGGGTC CGCGGCCCCG GCGTCCGGCC CCCTTCCGGC TCCGGACGCC 30720 

GGGACCGTCG GCGAGCCGCT CGCCGAGCGG TTGGCCGGAC TCTCCGCCGA GGAACGCCAC 3 0780 

GACCGGCTGC TCGGCCTGGT CGGCGAACAC GTGGCCGCGG TACTGGGCCA CGGCTCCGCC 3 0840 

20 

GCCGAGGTCC GGCCCGACCG GCCGTTCCGC GAGGTCGGGT TCGACTCGCT CACGGCCGTG 30900 

GAACTGCGCA ACCGGATGGC GGCGGTCACC GGGGTCAGGC TCCCCGCCAC CCTGGTCTTC 30960 

GACCACCCCA CCCCCGCCGC GCTGTCCTCG CACCTCGACG GCCTGCTGGC CCCGGCACAG 31020 

25 

CCGGTCACCA CCACACCGCT GCTGTCCGAA CTGGACCGCA TCGAGGAGGC CCTGGCCGCC 31080 

CTCACCCCCG AGCACCTCGC GGAGCTCGCC CCCGCCCCCG ACGACCGGGC CGAGGTCGCC 31140 

30 CTGCGCCTGG ACGCCCTGGC CGACCGCTGG CGCGCCCTGC ACGACGGCGC GCCCGGCGCC 31200 

GACGACGACA TCACCGACGT GCTGAGCAGC GCCGACGACG ACGAGATCTT CGCGTTCATC 31260 

GACGAGCGGT ACGGCACGTC GTGACCGCCG GCCCGGAGCC CCGCCCGTCA TCGAAAGGAA 31320 

35 GCACCACCAT GGCGAACGAA GAGAAGCTGC GCGCCTACCT CAAGCGCGTG ACGGGTGAGC 31380 

TGCACCGGGC CACCGAGCAG CTGCGTGCCC TGGACCGGCG GGCCCACGAG CCGATCGCGA 31440 

TCGTCGGGGC GGCCTGCCGA CTCCCCGGCG GCGTCGAGAG TCCGGACGAC CTCTGGGAGC 31500 

40 

TGCTGCACGC CGGTGCCGAC GCGGTCGGCC CGGCCCCCGC CGACCGCGGC TGGGACGTGG 31560 

AGGGAAGGTA CTCGCCCGAC CCCGACACGC CCGGCACCTC GTACTGCCGC GAGGGCGGCT 31620 

TCGTGCAGGG GGCCGACCGG TTCGACCCCG CCCTCTTCGG CATCTCGCCC AACGAGGCGC 31680 

45 

TCACCATGGA CCCCCAGCAG CGGCTGCTGC TGGAGACCTC CTGGGAGGCG CTGGAGCGAG 31740 

CCGGTCTGGA CCCCCAGTCC CTGGCGGGCA GCCGGACCGG CGTGTTCGCC GGGGCGTGGG 31800 

50 AGAGCGGCTA CCAGAAGGGC GTCGAAGGGC TCGAAGCCGA TCTGGAGGCC CAACTCCTGG 31860 

CCGGCATCGT CAGCTTCACC GCCGGCCGCG TCGCCTACGC CCTGGGCCTG GAGGGCCCGG 31920 

CGCTGACGAT CGACACGGCC TGCTCCTCGT CGCTGGTGGC ACTGCACCTG GCGGTGCAGT 31980 

CACTCCGCCG GGGCGAGTGC GACCTCGCAC TGGCGGGCGG CGCCACGGTC ATCGCCGACT 32040 
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TCGCGCTCTT CACCCAGTTC TCCCGGCAGC 
CCTTCGGTGA GACGGCCGAC GGCTTCGGCC 

5 

AGCGGCTGTC GGACGCCCGC CGCAACGGGC 
CCGTCAACCA GGACGGTGCG AGCAATGGGC 

10 GGGTGATCGG TGAGGCGCTG GCCGACGCGG 

AGGCGCACGG CACCGGCACG CCGCTCGGCG 
CGTACGGGCA CGAACGGACG GGCGACCCGC 

75 GGCACACCCA GGCCGCCGCC GGCGTGGCCG 

ACGGTGAGCT GCCGCGCACC CTGCACGCGT 
CGGGCGCCGT GGAGTTGCTG GACGAGGCCA 

20 

GCCGGGCGGG CATCTCCTCG TTCGGCATCA 
AGGAGCCGCC CGCCCGGCCG GAGCCCGAGG 
CCGTCCTCCC GCTGTCGGCC GCCGGCGCGC 

25 

CCGCGCACCT GGCCGGCCAC GAGGAGATCA 
CCACCCGTGC CGCGCTCTCG CACCGGGCCT 

30 TCGACAGGCT GACCGCGCTG GCGGAGGACA 

CGGGCAGCGG CCGGCCCCCC GTCTTCGTCT 
TGGGCGCCGA ACTCCTGGAC AGGGCACCGG 

55 GGGCCCTCGC GGCCCACCTC GACTGGTCGG 

CCCCGCCGAT CGACCGCGCG GACGTCGTCC 
TCGCGGCGCT GTGGGAGTCC CACGGTGTAC 

40 

GCGAGATCGC CGCCGCCCAC GCGGCCGGTG 
TCGCCGAGCG CAGCAGGCTC TGGAAGCGGC 

4$ TGGCCCCGGC CGACCGGGTC CGCGAACTGA 

CCGCCGTCAA CGGCCCCGCC TCGGTCACCG 
TCGGCGGCCG GCTCTCCGCC GCCGGGGTGC 

SO CCGGACACTC ACCCCAGGTG GAGCAGTTCC 

TCCGCCCGAC CGCCGCCCGG CTGCCCTTCT 
CCGAAGGCCT GGACGCCGCG TACTCGTACC 

55 

CCACCCTGCG GACGCTGCTG CGCGAGGGCC 



GCGGGCTCGC CCCCGACGGG CGGTGCAAGG 32100 

CCGCCGAGGG CGCGGGGATG CTGCTGGTCG 32160 

ACCCGGTGCT GGCGGTGGTG CGGGGCAGTG 32220 

TGACGGCGCC GAGTGGTCCT GCGCAGCAGC 32280 

GGCTGACGCC CGCCGACGTG GACGCGGTCG 32340 

ACCCCATCGA GGCCGGCGCG CTCATGGCGA 32400 

TGTGGCTGGG TTCGCTGAAG TCGAACATCG 32460 

GGGTGATCAA GATGGTGCTG GCGCTGCGCC 32520 

CGACGGCGTC CTCCAGGATC GAATGGGACG 32580 

GGCCCTGGCC CCGGCGTGCC GAGGGGCCGC 32640 

GCGGCACCAA CGCGCACCTC GTCATCGAGG 32700 

AGGCCGCGCA GCCGCCCGCC CCGGCCACCA 32760 

GATCCCTGCG CGAGCAGGCC CGCAGGCTCG 32820 

CCGCCGCCGA CGCCGCCCGC TCCGCCGCCA 32880 

CGGTCCTGGC CGACGACCGG CGGGCGCTGA 32940 

GGAAGGACCC CGGCGTCACC GTCGGCGAGG 33 000 

TCCCGGGACA GGGCTCCCAG TGGACGGGCA 33 060 

TCTTCCGCGC CAAGGCCGAG GAGTGCGCGC 33120 

TGCTCGACGT CCTGCGCGAC GCGCCCGGCG 33180 

AGCCGACCCT GTTCACCATG ATGGTCTCCC 33240 

GGCCCGCCGC CGTGGTCGGC CACTCCCAAG 33300 

CCCTGTCCCT CGACGACGCG GCCCGCGTGA 33360 

TGGCCGGAAA CGGCGGCATG CTCTCCGTGA 33420 

TGGAGCCCTG GGCGGAGCGG ATGTCCGTGG 33480 

TGGCCGGTGA CGCGCGGGCG CTGGAGGAGT 33 540 

TGCGCTGGCC CCTCGCCGGC GTCGACTTCG 33 600 

GCGCCGAGCT CCTCGACACG CTGGGCACCG 33660 

TCTCCACCGT GACCGCCGCG GCGCACGAGC 33720 

GGAACATGCG CGAACCCGTG GAGTTCGCGT 33780 

ACCGCACCTT CGTCGAGATG GGCCCGCACC 33840 
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CCCTGCTGGG CGCCGCGATC GACGAGGTCG 
TCGCCACCCT CCACCGCGGC TCCGGCGGCC 

5 

CGTTCGCCCA CGGAGTACGG GTCGACTGGG 
GGGTCCCGCT GCCCACCTAC GCCTTCAGCC 

1Q GCCGGCGCGC CGTCGAGGCG GCCCCCGTCG 

CCTGGACACC CGTGGCATCC GACGACTCCG 
AGACCCCCGG CACCGCGCCG GACGAGGCGG 

75 GGGTGGTCGT GGAGCGCTGC CTGCTGGATC 

AGCGACTGGC CGAACTGGAC GCGCAGCCGG 
GCCGTCCGCA GAGCACCGCA CCGGCCGACG 

2J TCCTGCTCGT GGTCCAGGCC GTGCCGGACG 

CGCGGGGTGC GGTGGCGGTG GGGTCGGGTG 
GGGGTCTGGG GCGGGTGGCT GCGTTGGAGG 

25 

TGGCGGTGGG GGCGGGTGTG CGTGAGTGGC 
GTGAGGATCA GGTGGCGGTG CGTGGTGGGG 

30 GGGTGCGGGG TGGTTCGGGG GTGTGGCGTG 

TGGGTGGTGT GGGGGGTCAT GTGGCGCGGT 
TGTTGGCGGG GCGTCGGGGT GGTGGGGTTG 

35 TGGGGTTGGG GGCGAAGGTG ACGTTCGTTT 

TGGGGTTGTT GGGTGTGGTG GAGGGGTTGG 
CGGGGGTGGC TCAGGTGTCG GGGTTGGGTG 

40 

TGGGGGGTAA GGCGGTGGGG GCTGAGTTGT 
ATGCGTTCGT GTTGTTCTCG TCGGGTGCTG 
ATGCGGCGGC CAATGCGCAT CTGGATGCGT 

45 

CCGCGACCTC CGTCGCCTGG GGCCTGTGGG 
TCACGGAGTT CTACGCCGAG CGCGGCCTCG 

50 CACTGCACAC GGCACTGAAC GAGGGCGACA 

AACACTTCGT CACCGGGTTC ACCGCCTACC 
AGGTCCGCGC GTTGCGCACG CCCGAACCCA 

55 GCGTCGACGC CGCCCTCACC CCGCGCGAGC 



CCGAGGCCGA GGGCGTGCAC GCCACCGCCC 33900 

TGGACCGGTT CCGCTCCTCG GTGGGCGCCG 33960 

ACGCCCTCTT CGAGGGCTCC GGCGCCCGCC 34020 

GGGACCGGTA CTGGCTGCCC ACCGCCATCG 34080 

ACGCGTCCGC CCCCGGGCGC TACCGCGTCA 34140 

GCCGGCCCTC CGGGCGCTGG CTGCTGGTGC 34200 

ACACCGCGGC GTCGGCCCTC GGTGCGGCCG 34260 

CCACCGAGGC CGCGCGCGTC ACGCTCACCG 34320 

AGGGCCTGGC CGGCGTGCTG GTGCTGCCCG 34380 

CCTCCCCGCT CGACCCGGGG ACGGCCGCCG 3 4440 

CCGCTCCGAA GGCCCGGATC TGGGTGGTGA 34500 

AGGTGCCGTG TGCGGTGGGT GCGCGGGTGT 34560 

TGCCGGTGCA GTGGGGTGGG TTGGTGGATG 34620 

GTCGTGTGGT GGGTGTGGTT GCGGGGGGTG 34680 

GTGTGTTCGG TCGTCGTCTG GTGGGTGTGG 34740 

CGCGGGGGTG TGTGGTGGTG ACGGGTGGGT 34800 

GGTTGGCGCG TTCGGGTGCG GAGCATGTGG 34860 

TGGGGGCGGT GGAGTTGGAG CGGGAGTTGG 34920 

CGTGTGATGT GGGGGATCGG GCGTCGATGG 34980 

GGGTGCCGTT GCGTGGTGTG TTTCATGCGG 35040 

AGGTGTCGTT GGCGGAGGCG GGTGGTGTGT 35100 

TGGACGAGTT GACGGCGGGT GTGGAGCTGG 3 5160 

GGGTGTGGGG GAGTGGGGGG CAGTCGGTGT 35220 

TGGCGGAGCG TCGTCGTGCG CAGGGGCGTC 35280 

GCGGCGAGGG CATGGGAGCG GACGAAGGCG 35340 

CCCCCATGCG GCCCGAGTCG GGCATCGAGG 35400 

CCTGCGTCAC GGTCGCCGAC ATCGACTGGG 35460 

GGCCCAGCCC GCTGATCTCC GACATCCCCC 35520 

CCGTGGACGC CTCGGACGGA CTGCGCCGGC 35580 

GCACCAAGGT CCTGGTCGAC CTGGTCCGCA 35640 
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CGGTGGCGGC GGAGGTCCTC GGTCACGACG 
5 TCCGGGACCT CGGCTTCGAC TCGCTGGCCG 

CGACCGGACT CGTACTGCCC GCGACGGTCA 
GCGGCGCGCT GCTGGAGCGG CTGTCCGCGG 
10 CCGCCGGGGG GAGGCCCGCG ACCCCACCGC 

ACATCGACGA ACTGGACGCG GACGCCCTGA 
CCGACGGCAC GCCGGCCGAC GGCGGGCCCG 
75 CGGAGCAGTA GCGCGCCCTC ACCGGCGCGC 

CCCCACAGCC AGCGAGCAGA CGAGGAAGCC 
GGGTGCGCTG CGCACCTCCG TCAAGGAGAC 

20 

CCTGGCCGGC GCGCACGAGC CGGTCGCCAT 
CGTGAGCACC CCGGACGACC TGTGGGAGCT 
CTTCCCGGCC GACCGGGGCT GGGACGAGGA 

25 

CGGCACCACC TACTGCCGTG AGGGCGGCTT 
CTTCTTCGGC ATCTCGCCGA ACGAGGCGCT 
30 GGAGACGTCG TGGGAGACGT TGGAGCGGGC 

CCGTACCGGT GTCTTCGTCG GAGCCGCGCA 
GCCCGAGGGC ACCGAGGGCT ATCTGCTGAC 
GATCGCCTAC TCCCTGGGTC TGGAGGGGCC 
GTCGTTGGTG GCGTTGCATC TGGCGGTGCA 
GTTGGCCGGC GGCGTCGCGG TCATGCCCGA 

40 

GCGGGGGCTG GCGGTGGACG GGCGGTGCAA 
GTGGGCGGAG GGAGTGGGTG TGCTGCTGGT 

45 CCATCGGGTG CTGGCGGTGG TGCGGGGCAG 

GCTGACGGCG CCGAGTGGTC CTGCGCAGCA 
GGGGCTGACG CCCGCCGACG TGGATGTGGT 

50 TGATCCGATC GAGGCGGGTG CGTTGCTGGC 

TTTGTGGTTG GGGTCGTTGA AGTCGAACAT 
TGGTGTGATC AAGGTGGTGC AGGCGATGCG 

55 

GGATGCGCCG TCGTCGAAGG TGGAGTGGGC 



GGATCGGCGG CATCGGCCAC GACGTGGCCT 35700 

CGGTCCGGAT GCGCGGCCGG CTGGCCGAGG 35760 

TCTTCGACCA CCCCACCGTG GACCGGCTCG 35820 

ACGAACCCGC GCCCGGCGGG GCGCCGGAGC 35880 

CCGCACCGGA GCCGGCCGTC CACGACGCCG 35940 

TCCGGCTGGC CACGGGAACC GCCGGACCGG 36000 

ACGCGGCGGC GACCGCCCCC GACGGAGCAC 36060 

CGACCGGCGG AGCGCCGTAC CGCCGACGCC 36120 

GAAGATGTCA CCGTCCATGG ACGAAGTGCT 36180 

CGAGCGGCTG CGCCGGCACA ACCGGGAGCT 36240 

CGTGGGCATG GCCTGCCGCT ACCCCGGTGG 36300 

CGCCGCGGAC GGCGTCGACG CGATCACCCC 36360 

CGCCGTCTAC TCGCCCGACC CCGACACCCC 36420 

CCTCACCGGC GCCGGGGACT TCGACGCGGC 36480 

GGTGATGGAC CCGCAGCAGC GGCTGTTGCT 36540 

CGGCATCGTC CCCGCGTCGC TGCGCGGCAG 36600 

CACGGGATAC GTCACCGACA CCGCGCGAGC 3 6660 

GGGCAACGCC GATGCCGTCA TGTCCGGCCG 3 6720 

GGCGCTGACG ATCGGGACGG CCTGCTCGTC 3 6780 

GTCGTTGCGG CGGGGCGAGT GCGACCTGGC 36840 

CCCGACGGTG TTCGTGGAGT TCTCGCGGCA 3 6900 

GGCGTTCGCG GAGGGTGCGG ACGGGACGGC 36960 

GGAGCGGCTT TCCGACGCGC GCCGCAATGG 37020 

TGCGGTCAAT CAGGACGGGG CGAGCAATGG 37080 

GCGGG TGATC CGTGAGGCGC TGGCTGATGC 37140 

GGAGGCGCAC GGTACGGGGA CGGCGTTGGG 37200 

CACGTACGGG CGGGAGCGGG TCGGTGATCC 37260 

CGGGCATGCG CAGGCGGCTG CGGGTGTGGG 37320 

GCATGGGTCG TTGCCGCGGA CGCTGCATGT 37380 

TTCGGGTGCG GTGGAGCTGC TGACCGAGGG 37440 
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CCGGTCGTGG 


CCGCGGCGGG TGGAGCGGGT GCGGCGGGCC 


GCGGTGTCGG CGTTCGGGGT 


37500 


GAGCGGGACC 


AACGCCC A 1\j 




pp a agp Ar pp. 


GTCGAGGCCG GGAGCGAGCA 


3756U 


CGGGGACGGC 


CCCGGACCCG 




rp rr'p.TP & p P. 


GGTCCGCTCC CCTGGGTGCT 


37620 


CTCGGCACGC 








CGACTCGCCG CTCTCGCCCG 


3 / boU 


CCAGGGGCGC 


ACGGAGGGCA 






GTCGTCCCCG CGGCCGACAT 


^ "7*7 >♦ (\ 

3 / /40 


CGGATACTCC 


CTGGCCACCA 


C C AtaGOAO AU 




CGGGCGGTGG CGCTGGTGCA 


1 T Q A 

37800 


GGAGAACCGG 


ACGGCCGGGG 


AGGACU iXbC 




GCCGGCCGCA CACCGGAGAG 


i n o ^" a 

37860 


CGTGGTCACG 


GGTGTCGCGC 


GACG J/WjL. 




TTCCTCTGCT CGGGGChGGG 


37920 


CGCCCAGCGG 


CTCGGCGCCG 


GTCGGvjACjL. I 




TTCCCCGTCT TCGCCGACGC 


37980 


CCTCGACGAG 


ATCGCGGCGG 


AC» 1 IV-OAVwVjrU 


r 1 P 7i PP TP P. A A 


CGCCCTCTCC TGTCGGTGAT 


38040 


GTTCGCCGAG 


CCCGCCACCjL 




APTPPTPPAP 


CGCACCGACT ACACCCAGCC 


38100 


GGCCCTCTTC 


bLbb ITJUAUA 




PPPP.PTPPTG 


GAGAGCTGGG GCCTGGTCCC 


38160 


GGACGTCCTC 




pp ATpf^rcpra*; 




GCTCACGTGG CGGGCGTCTT 


38220 


CTCTGCGGCC 


L>AV_-VjUVjrL»U\_.L. 


LiVjV- IvAj IV. IV- 


PP.P APGCGGC 


CGGCTCATGC GGGCCCTGCC 


3 B28U 


CGAGGGCGGC 




CCGTGCAGGC 


CACCGAGCGG 


GAGGCCGCCG CGCTGGAGCC 


38340 


CGTCGCCGCC 


GGCGGCGCGG 


TGGTCGCCGC 


GGTCAACGGC 


CCGCAGGCCC TCGTGCTCTC 


38400 


CGGGGACGAG 


GCGGCCGTAC 


TGGCGGCGGC 


CGGTGAACTG 


GCCGCCCGCG GACGCCGCAC 


38460 


CAAGCGCCTG 


AGGGTGAGCC 


ACGCCTTCCA 


CTCACCCCGT 


ATGGACGCCA TGCTCGCCGA 


38520 


CTTCCGCGCG 


GTGGCGGACA 


CGGTCGACTA CCACGCCCCC 


CGGCTGCCGG TCGTCTCCGA 


*i O C O A 

38580 


AGTGACCGGC 


GACCTCGCCG ACGCCGCCCA GCTGACCGAC 


CCCGGCTACT GGACCCGCCA 


^ A C A A 

38640 


GGTGCGGCAG 


CCGGTGCGCT 


TCGCCGACGC 


CGTGCGCACC 


GCGAGCGCCC GGGACGCCGC 


1 A*7 A A 

38700 


GACCTTCATC 


GAGCTCGGGC 


CCGACGCCGT 


CCTGTGCGGC 


ATGGCGGAGG AGTCCCTGGC 


"i on c a 


CGCGGAGGCC 


GACGTCGTGT 


TCGCCCCGGC 


ACTGCGCCGC 


GGGCGCCCGG AGGGCGACAC 


1 Q Q O A 


CGTGCTCCGG 


GCCGCCGCGA GCGCGTACGT CCGCGGCGGG 


GGCCTCGACT GGGCCGCGCT 


1 OQ 0 A 

3oooU 


CTACGGCGGC 


ACGGGAGCCC 


GCCGCACCGA 


CCTGCCCACC 


TACGCCTTCC AGCACAGCCG 


^ A A J A 

38940 


CTACTGGCTC 


GCCCCCGCCT CGGCCGCGGT CGCCCCCGCG 


ACGGCCGCCC CCTCCGTCCG 


39000 


ATCCGTGCCG 


GAAGCCGAGC 


AGGACGGGGC 


GCTGTGGGCC 


GCCGTGCACG CCGGTGACGT 


39060 


CGCCTCGGCC 


GCGGCGCGAC 


TGGGCGCCGA 


CGACGCCGGT 


ATCGAACACG AACTGCGCGC 


39120 


GGTCCTGCCG 


CACCTGGCCG 


CCTGGCACGA CCGCGACCGC 


GCGACCGCGC GGACCGCGGG 


39180 


CCTGCACTAC 


CGCGTCACCT GGCAGGCGAT CGAGGCAGAC 


GCTGTCAGGT TCAGCCCCTC 


39240 
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GGATCGCTGG 

GGCGCTGCGC 

CGGATCACCG 

CGCACGGAGC 

GGTCGCCGGG 

ACTGACCGAC 

GGGGTCGGGT 

TGCGTTGGAG 

GCGTGAGTGG 

GCGTGGTGGG 

GGTGTGGCGT 

TGTGGCGCGG 

TGGTGGGGTT 

GACGTTCGTT 

GGAGGGGTTG 

GGGGTTGGGT 

GGCTGAGTTG 

GTCGGGTGCT 

TCTGGATGCG 

GGGCCCGTGG 

CGGCGTGGCC 

CGGCGAAGCC 

CGCCTTCCGT 

CGGCCGGCCC 

CCGGCCGCGG 

CGAGGACGCC 

CGCGGTGCGG 

CGTCTTCGAC 

GGGGACCCCC 

CCGCGCCGCC 



CTGATGGTCG AGCATGGGCA 
GCGGCCGGCG CGGAGGTCAC 
CGGACGGAGA CCCCGGACCG 
CCGGAGGGCC TGGCCGGCGT 
CACCCCGGGC TGGACCAGGG 
GCCGCGGTGC GGGCACCGCT 
GAGGTGCCGT GTGCGGTGGG 
GTGCCGGTGC AGTGGGGTGG 
CGTCGTGTGG TGGGTGTGGT 
GGTGTGTTCG GTCGTCGTCT 
GCGCGGGGGT GTGTGGTGGT 
TGGTTGGCGC GTTCGGGTGC 
GTGGGGGCGG TGGAGTTGGA 
TCGTGTGATG TGGGGGATCG 
GGGGTGCCGT TGCGTGGTGT 
GAGGTGTCGT TGGCGGAGGC 
TTGGACGAGT TGACGGCGGG 
GGGGTGTGGG GGAGTGGGGG 
TTGGCGGAGC GTCGTCGTGC 
GACGGCGACG GCATGGGCGA 
CCGCTCCACC CCGAGACGGC 
ACGGTCACCG TGGCGGACAT 
CCCAGCCCCC TGATCGCCGG 
GCCGAGGACA CCCCCACCGC 
CTCGCCCTGG ACCTGGTGCT 
CGGGTCGACG CCCGGGCCCC 
CTGCGCCGCC GGCTGGCCGA 
CACGAGGACC CCACCGCGCT 
GGCCCCCAGG AGGGCACGGC 
GTCGAACAGC GCAGGTCGAG 



GCACACGGAA 
CCGCCTGGTG 
CGGCACCCTG 
GCTGCTGCTC 
AACGGCGGCG 
GTGGGTGGTG 
TGCGCGGGTG 
GTTGGTGGAT 
TGCGGGGGGT 
GGTGGGTGTG 
GACGGGTGGG 
GGAGCATGTG 
GCGGGAGTTG 
GGCGTCGGTG 
GTTTCATGCG 
GGGTGGTGTG 
TGTGGAGCTG 
GCAGTCGGTG 
GCAGGGGCGT 
GATGGCGCCC 
GCTCACCGCC 
CGACTGGGAA 
CATCCCCGCG 
CCCCGGCCTC 
CCGCCACGTC 
CTTCCGGGAC 
GGACACCGGG 
GGCCCACCAC 
TCGGGCCGAG 
CGAGGTCGTG 



TGCGCGGACG 

TGGCCGCTGG 

GCGGCCCGGC 

CCCGACTCGG 

GTGCTGCTGA 

ACGCGGGGTG 

TGGGGTCTGG 

GTGGCGGTGG 

GGTGAGGATC 

GGGGTGCGGG 

TTGGGTGGTG 

GTGTTGGCGG 

GTGGGGTTGG 

GTGGGGTTGT 

GCGGGGGTGG 

TTGGGGGGTA 

GATGCGTTCG 

TATGCGGCGG 

CCCGCGACCT 

GAGGGCTACT 

CTGCACCAGG 

CGGTTCGCCC 

GCCCGTACGG 

CTGCGGGCGC 

GCGGCGGTCC 

CTCGGCTTCG 

CTCGACCTGC 

CTGGCCGGCC 

AGCGGGCTGT 

GAGCTGATGG 



CCGCGGAACG 

AGCAGCACAC 

TGGCCGAGCT 

GCGGTGCCGC 

CGATCCAGGC 

CGGTGGCGGT 

GGCGGGTGGC 

GGGCGGGTGT 

AGGTGGCGGT 

GTGGTTCGGG 

TGGGGGGTCA 

GGCGTCGGGG 

GGGCGAAGGT 

TGGGTGTGGT 

CTCAGGTGTC 

AGGCGGTGGG 

TGTTGTTCTC 

CCAATGCGCA 

CCGTCGCCTG 

TCGCCCGCCA 

CCATCGACGG 

CCGGCTTCAC 

CGCCCGCCGC 

GGCCCGAGGA 

TCGGCCACTC 

ACTCGCTCGC 

CCGGCACCCT 

TCGCCGACGC 

TCGCCTCCTT 

CCGACCTGGC 



39300 

39360 

39420 

39480 

39540 

39600 

39660 

39720 

39780 

39840 

39900 

39960 

40020 

40080 

40140 

40200 

40260 

40320 

40380 

40440 

40500 

40560 

40620 

40680 

40740 

40800 

40860 

40920 

40980 

41040 
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GGCGTTCCGG CCCGCCTACT CCCGGCAGCA CCCCGGCTCC GGCCGCCCCG CGCCCGTACC 41100 

CCTCGCGACC GGACCGGCGA CGCGTCCCAC GCTGTACTGC TGCGCCGGCA CCGCGGTCGG 41160 

5 

CTCCGGGCCC GCCGAGTACG TCCCGTTCGC CGAAGGACTG CGCGGCGTCC GGGAGACGGT 41220 

CGCCCTTCCC CTGTCCGGCT TCGGCGACCC CGCGG AACCG ATGCCCGCAT CGCTCGACGC 41280 

w GCTGATCGAG GTCCAGGCCG ACGTCCTCCT GGAGCACACC GCGGGCAAGC CCTTCGCCCT 41340 

CGCCGGCCAC TCCGCCGGCG CGAACATCGC CCACGCCCTG GCCGCCCGGC TGGAGGAACG 41400 

CGGCTCGGGC CCCGCAGCCG TCGTACTGAT GGACGTCTAC CGTCCCGAGG ACCCCGGTGC 41460 

15 GATGGGCGAG TGGCGCGACG ACCTGCTCAG CTGGGCGCTC GAACGCAGCA CGGTGCCCCT 41520 

GGAGGACCAC CGGCTCACCG CCATGGCCGG CTATCAGCGG CTGGTGCTCG GAACCCGGCT 41580 

CACCGCCCTC GAAGCCCCCG TCCTGCTGGC CCGGGCGTCC GAACCCCTGT GCGCGTGGCC 41640 

20 

GCCCGCGGGC GGGGCGCGGG GCGACTGGCG GTCCCAGGTC CCGTTCGCAC GGACCGTCGC 41700 

CGACGTGCCC GGCAACCACT TCACCATGCT CACCGAACAC GCCCGGCACA CCGCGTCCCT 41760 

GGTGCACGAA TGGCTGGACA GCCTCCCGCA CCAGCCCGGT CCCGCCCCGC TCACCGGAGG 41820 

25 

GAAACACTGA TGTACGCCGA CGACATCGCG GCCGTCTACG ACCTGGTCCA CGAGGGGAAG 41880 

GGGAAGGACT ACCGGCAGGA GGCCGAGGAG ATCGCCGCAC TCGTGCGCGT CCACCGGCCG 41940 

30 GGCGCCCGGA CCCTGCTCGA CGTGGCCTGC GGCACCGGCC AGCACCTGCA CCACCTGGAC 42000 

GGCCTCTTCG ACCACGTCGA GGGCCTGGAA CTCTCCGCCG ACATGCTGGC CCTCGCGACC 42060 

GGCCGGAACC CCGGTGTCAC CTTCCACCAA GGGGACATGC GCTCGTTCTC CCTGGGACGC 42120 

35 CGGTTCGACG CGGTGACCTG CATGTTCAGC TCCATAGGCC ACCTGCGGAC CACCGACGAA 42180 

CTCGACAGCA CGCTGCGGGC CTTCACCGAC CACCTCGAAC CGTCCGGCGT CATCGTCGTC 42240 

GAACCCTGGT GGTTCCCCGA GTCCTTCACC CCCGGTTACG TCGGCGCCAG CATCACGGAG 42300 

40 

GCGGGCGAGC GCACCGTCTG CCGGGTCTCG CACTCCGTAC GGGAGGGGAA CGCCACCCGC 42360 

ATCGAGGTGC ACTACCTCCT CGCCGGACCC GGCGGCGTCC GTCACCTGAC CGAGGACCAC 42420 

ACCATCACCC TGTTCCCGCG CGCCGACTAC GAGGCGGCCT TCGAGCGCQC CGGCTGCGAC 42480 

45 

GTGGTCTACC AGGAAGGCGG CCCGTCCGGT CGCGGGCTGT TCATCGGCAC CCGCCGCTGA 42540 

CCCGGTGCCG ACGCGGACCG CCGCGGCCCG GAGGCGGGTT GCCCCGACCC ACCCGGCACA 42600 

50 CCCGGGTCCC CCGATCGTGC GAGCGCCCCC ATCGACCCGA GAAGAAAGGC AGGGCAGCCA 42660 

TGCCCACCCT TGCCACGGAA ACGGCCCCCG CGAGCACGAG CACGAGCGCG GGCACGAGCA 42720 

CGGGCGTCCG TGCGCTCGGC CGTCGGCTCC AGCTGACCCG GGCCGCACAC TGGTGCGCCG 42780 

GCAACCAGGG CGACCCGTAC GCGCTGATCC TGCGCGCCGT CGCCGACCCC GAGCCGTTCG 42840 
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AACGGGAGAT CCGGGCCCGC GGACCGTGGT TCCGCAGCGA ACAGCTGGAC GCCTGGGTGA 42900 

CCGCGGACCC CGAGGTGGCG GCGGCCGTCC TGGCCGACCC GCGCTTCGGC ACGCTGGACC 42960 

GGGCCGGACG CCGCCCGGAC GAGGAACTGC TGCCCCTCGC CGAGGCGTTC CCCCACCACG 43020 

AACGCGCGGA GCTCGTACGC CTGCGGGCGC TGGCCGCCCC GGTGCTCAGC CGGTACGCCC 43080 

CGGCCCAGGC GCCCTGCGCG GCGCGCACCA CCGCCCGCAG AGTGCTCGGC CGCCTGCTGC 43140 

CCACCGGTGA CGCCGGGTTC GACCTTGTCG GCGAGGTCGC CCGGCCCTAC GCCGTCGAGC 43200 

TGATGCTCAG GCTCCTCGGA GTGCCGGGCC GCGACCGCGC CACCGCCGCG CGGGCACTCG 43260 

CCGCCTGCGG CCCCCAGCTC GACGCCCGGA TGGCCCCGCA ACTGCTGACC GTGGCCCGGG 43320 

AGTCCGCCGA CGCCGTCCGC ACACTGGCCG ACCTGGTCCC CGAGCTCGTC GCGGAGAAGT 43380 

CCCGGGGCCT CGGGAACGCC GAGCCCCGGC CCGACGACGT GCTCGCCCTC CTCCTGCACG 43440 

ACGGCGTCGC CCCCGGCGAC GTCGAGCGCA TCGCGCTGCT CCTCGCGGTC GGCGCACCCG 43500 

AACCCGTCGT CACCGCCGTC GCGCACACGG TCCACCGGCT GCTCGGCCGG CCGGGGGAGT 43560 

GGGAGAGGGC CCGCCGGACG CCGGCCGCGG CGAACGCCGT CGACCAGGTG CTGCGCGAGC 43620 

GCCCCCCGGC CCGGCTGGAG AACCGGGTCG CGCACACCGG CCTCGAACTC GGCGGCCGCC 43680 

GGATCACCGC CGACGAGCAC GTCGTGGTGC TGGCCGCCGC CGGACGGGAG ATCCCCGGGC 43740 

CGGAGCCGCT CGGGGGCGCC GACGGACCGC ACCTGGCGCT CGCCCTCCCG CTGATCCGCC 43800 

TGGCCGCCAC CACCGCGGTC CAGGTCACGG CCGGCCGCCT GCCCGGCCTG CGGGCCGAGG 43860 

GACCGCCCCT GACCCGGCCG CGGTCACCGG TCCTGGGCGC CTGCGCCCGC CTCCGGGTCC 43920 

ACCCGGGATG ACCCCGCCGT CCGTACGCCC CCTCCCAGAC CGGAGCCGCT GTGCGCGTCC 43980 

TGCTGACATC CCTCGCCCAC AACACCCACT ACTACAGTCT GGTGCCCCTC GCCTGGGCGC 44040 

TGCGCGCCGC CGGGCACGAG GTACGGGTGG CGAGCCCGCC CTCCCTCACC GACGTCATCA 44100 

CCTCCACCGG TCTGACCGCC GTACCGGTGG GCGACGACCG ACCGGCCGCG GAGCTGCTCG 44160 

CCGAGATGGG CAGAGACCTC GTCCCCTACC AGAGGGGCTT CGAGTTCGGT GAGGTGGAGA 44220 

3GCGAGGAGGA GACCACCTGG GAGTACCTGC TCGGCCAGCA GAGCATGATG GCCGCCCTGT 44280 

GCTTCGCCCC GTTCAACGGC GCCGCCACGA TGGACGAGAT CGTCGACTTC GCCCGTGGCT 44340 

GGCGGCCCGA CCTGGTCGTG TGGGAACCCT GGACCTA 44377 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4550 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ser Gly Glu Leu Ala lie Ser Arg Ser Asp Asp Arg Ser Asp Ala 
15 10 15 

Val Ala Val Val Gly Met Ala Cys Arg Phe Pro Gly Ala Pro Gly He 
20 25 30 

Ala Glu Phe Trp Lys Leu Leu Thr Asp Gly Arg Asp Ala He Gly Arg 
35 40 45 

Asp Ala Asp Gly Arg Arg Arg Gly Met He Glu Ala Pro Gly Asp Phe 
50 ~ 55 60 

Asp Ala Ala Phe Phe Gly Met Ser Pro Arg Glu Ala Ala Glu Thr Asp 
65 70 75 80 

Pro Gin Gin Arg Leu Met Leu Glu Leu Gly Trp Glu Ala Leu Glu Asp 
85 90 95 

Ala Gly He Val Pro Gly Ser Leu Arg Gly Glu Ala Val Gly Val Phe 
100 105 110 

Val Gly Ala Met His Asp Asp Tyr Ala Thr Leu Leu His Arg Ala Gly 
115 120 125 

Ala Pro Val Gly Pro His Thr Ala Thr Gly Leu Gin Arg Ala Met Leu 

130 135 140 

Ala Asn Arg Leu Ser Tyr Val Leu Gly Thr Arg Gly Pro Ser Leu Ala 

145 150 155 160 

Val Asp Thr Ala Gin Ser Ser Ser Leu Val Ala Val Ala Leu Ala Val 
165 170 175 

Glu Ser Leu Arg Ala Gly Thr Ser Arg Val Ala Val Ala Gly Gly Val 
180 185 190 

Asn Leu Val Leu Ala Asp Glu Gly Thr Ala Ala Met Glu Arg Leu Gly 
195 200 205 

Ala Leu Ser Pro Asp Gly Arg Cys His Thr Phe Asp Ala Arg Ala Asn 
210 ^ 215 " 220 

Gly Tyr Val Arg Gly Glu Gly Gly Ala Ala Val Val Leu Lys Pro Leu 
225 230 235 240 

Ala Asp Ala Leu Ala Asp Gly Asp Pro Val Tyr Cys Val Val Arg Gly 
245 250 255 

Val Ala Val Gly Asn Asp Gly Gly Gly Pro Gly Leu Thr Ala Pro Asp 
260 265 270 

Arg Glu Gly Gin Glu Ala Val Leu Arg Ala Ala Cys Ala Gin Ala Arg 
275 280 285 

Val Asp Pro Ala Glu Val Arg Phe Val Glu Leu His Gly Thr Gly Thr 
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290 



295 



300 



Pro Val Gly Asp Pro Val Glu Ala His Ala Leu Gly Ala Val His Gly 
305 310 315 320 

Ser Gly Arg Pro Ala Asp Asp Pro Leu Leu Val Gly Ser Val Lys Thr 
325 330 335 

Asn He Gly His Leu Glu Gly Ala Ala Gly He Ala Gly Leu Val Lys 
340 345 350 

Ala Ala Leu Cys Leu Arg Glu Arg Thr Leu Pro Gly Ser Leu Asn Phe 
355 360 365 

Ala Thr Pro Ser Pro Ala He Pro Leu Asp Gin Leu Arg Leu Lys Val 
370 375 380 

Gin Thr Ala Ala Ala Glu Leu Pro Leu Ala Pro Gly Gly Ala Pro Leu 
385 390 395 * 400 

Leu Ala Gly Val Ser Ser Phe Gly He Gly Gly Thr Asn Cys His Val 
405 410 415 

Val Leu Glu His Leu Pro Ser Arg Pro Thr Pro Ala Val Ser Val Ala 
420 425 430 

Ala Ser Leu Pro Asp Val Pro Pro Leu Leu Leu Ser Ala Arg Ser Glu 
435 440 445 

Gly Ala Leu Arg Ala Gin Ala Val Arg Leu Gly Glu Tyr Val Glu Arg 
450 455 460 

Val Gly Ala Asp Pro Arg Asp Val Ala Tyr Ser Leu Ala Ser Thr Arg 
465 470 475 480 

Thr Leu Phe Glu His Arg Ala Val Val Pro Cys Gly Gly Arg Gly Glu 

485 490 495 

Leu Val Ala Ala Leu Gly Gly Phe Ala Ala Gly Arg Val Ser Gly Gly 
500 505 ~ 510 

Val Arg Ser Gly Arg Ala Val Pro Gly Gly Val Gly Val Leu Phe Thr 
515 520 525 

Gly Gin Gly Ala Gin Trp Val Gly Met Gly Arg Gly Leu Tyr Ala Gly 
530 535 540 

Gly Gly Val Phe Ala Glu Val Leu Asp Glu Val Leu Ser Met Val Gly 
545 550 555 560 

Glu Val Asp Gly Arg Ser Leu Arg Asp Val Met Phe Gly Asp Val Asp 
565 570 ~ 575 

Val Asp Ala Gly Ala Gly Ala Asp Ala Gly Ala Gly Ala Gly Ala Gly 
580 585 590 

Val Gly Ser Gly Ser Gly Ser Val Gly Gly Leu Leu Gly Arg Thr Glu 
595 600 605 



Phe Ala Gin Pro Ala Leu Phe Ala Leu Glu Val Ala Leu Phe Arg Ala 
610 615 620 
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Leu Glu Ala Arg Gly Val Glu Val Ser Val Val Leu Gly His Ser Val 
625 630 635 640 



5 


Gly 


Glu 


Val 


Ala 


Ala 
645 


Ala 


Tyr 


Val 


Ala Gly Val 
650 


Leu 


Ser 


Leu 


Gly 

£ CC 
ODD 


Asp 




Ala 


Val 


Arg 


Leu 
660 


Val 


Val 


Ala 


Arg 


Gly Gly Leu 
665 


Met 


Gly 


Gly 
670 


Leu 


Pro 


10 


Val 


Gly 


Gly 
675 


Gly 


Met 


Trp 


Ser 


Val 
680 


Gly Ala Ser 


Glu 


Ser 
685 


Val 


Val 


Arg 


15 


Gly 


Val 
690 


Val 


Glu 


Gly 


Leu 


Gly Glu 
695 


Trp Val Ser 


Val 
700 


Ala 


Ala 


Val 


Asn 


Gly 
705 


Pro 


Arg 


Ser 


Val 


Val 
710 


Leu 


Ser 


Gly Asp Val 
715 


Gly 


Val 


Leu 


Glu 


Ser 
720 


20 


Val 


Val 


Ala 


Ser 


Leu 
725 


Met 


Gly Asp 


Gly Val Glu 
730 


Cys 


Arg 


Arg 


Leu 
735 


Asp 




Val 


Ser 


His 


Gly 
740 


Phe 


His 


Ser 


Val 


Leu Met Glu 
745 


Pro 


Val 


Leu 
750 


Gly 


Glu 


25 


Phe 


Arg 


Gly 
755 


Val 


Val 


Glu 


Ser 


Leu 
760 


Glu Phe Gly 


Arg 


Val 
765 


Arg 


Pro 


Gly 




Val 


Val 
770 


Val 


Val 


Ser 


Gly 


Val 
775 


Ser 


Gly Gly Val 


Val 
780 


Gly 


Ser 


Gly 


Glu 


30 


Leu 
785 


Gly 


Asp 


Pro 


Gly 


Tyr 

790 


Trp Val 


Arg His Ala 
795 


Arg 


Glu 


Ala 


Val 


Arg 
800 




Phe 


Ala 


Asp 


Gly 


Val 
805 


Gly 


Val 


Val 


Arg Gly Leu 
810 


Gly 


Val 


Gly 


Thr 
815 


Leu 


Jo 


Val 


Glu 


Val 


Gly 
820 


Pro 


His 


Gly Val 


Leu Thr Gly 
825 


Met 


Ala 


Gly 
830 


GlU 


Cys 


40 


Leu 
Arg 


Gly 

Ala 
850 


Ala 
835 
Glu 


Gly 
Arg 


Asp 
Glu 


Asp 
Val 


Val 

Phe 
855 


Val 
840 
Glu 


Val Val Pro 
Ala Ala Leu 


Ala 

Ala 
860 


Met 
845 
Thr 


Arg 
Val 


Arg 
Phe 


Gly 
Thr 




Arg 
865 


Asp 


Ala 


Gly 


Leu 


Asp 
870 


Ala 


Thr 


Ala Leu His 
875 


Thr 


Gly 


Ser 


Thr 


Gly 
880 


45 


Arg 


Arg 


He 


Asp 


Leu 


Pro 


Thr 


Tyr 


Pro Phe Gin 


Arg 


Arg 


Thr 


His 


Trp 



885 



890 



895 



50 



Ser Pro Ala Leu Ser Arg Pro Val Thr Ala Asp Ala Gly Ala Gly Val 
900 905 910 

Thr Ala Thr Asp Ala Val Gly His Ser Val Ser Pro Asp Pro Glu Ser 
915 920 925 



55 



Thr Glu Gly Thr Ser His Arg Asp Thr Asp Asp Glu Ala Asp Ser Ala 
930 935 940 
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Ser Pro Glu Pro Met Ser Pro Glu Asp Ala Val Arg Leu Val Arg Glu 
945 950 955 " 960 

Ser Thr Ala Ala Val Leu Gly His Asp Asp Pro Gly Glu Val Ala Leu 
965 970 975 

Asp Arg Thr Phe Thr Ser Gin Gly Met Asp Ser Val Thr Ala Val Glu 
980 985 990 

Leu Cys Asp Leu Leu Lys Gly Ala Ser Gly Leu Pro Leu Ala Ala Thr 
995 1000 1005 

Leu Val Tyr Asp Leu Pro Thr Pro Arg Ala Val Ala Glu His lie Val 
1010 1015 1020 

Glu Ala Ala Gly Gly Pro Lys Asp Ser Val Ala Gly Gly Pro Gly Val 
1025 1030 1035 " 1040 

Leu Ser Ser Ala Ala Val Gly Val Ser Asp Ala Arg Gly Gly Ser Arg 
1045 1050 1055 

Asp Asp Asp Asp Pro He Ala He Val Gly Val Gly Cys Arg Leu Pro 
1060 1065 1070 

Gly Gly Val Asp Ser Arg Ala Ala Leu Trp Glu Leu Leu Glu Ser Gly 
1075 1080 1085 

Ala Asp Ala lie Ser Ser Phe Pro Thr Asp Arg Gly Trp Asp Leu Asp 
1090 1095 1100 

Gly Leu Tyr Asp Pro Glu Pro Gly Thr Pro Gly Lys Thr Tyr Val Arg 
H05 1110 ni5 H20 

Glu Gly Gly Phe Leu His Ser Ala Ala Glu Phe Asp Ala Glu Phe Phe 
1125 H30 H35 

Gly He Ser Pro Arg Glu Ala Thr Ala Met Asp Pro Gin Gin Arg Leu 
1140 H45 H50 

Leu Leu Glu Ala Ser Trp Glu Ala Leu Glu Asp Ala Gly Val Leu Pro 
1155 H60 H65 

Glu Ser Leu Arg Gly Gly Asp Ala Gly Val Phe Val Gly Ala Thr Ala 
1170 H75 H80 

Pro Glu Tyr Gly Pro Arg Leu His Glu Gly Ala Asp Gly Tyr Glu Gly 
H*5 1190 1195 - " 1200 

Tyr Leu Leu Thr Gly Thr Thr Ala Ser Val Ala Ser Gly Arg He Ala 
1205 1210 1215 

Tyr Thr Leu Gly Thr Gly Gly Pro Ala Leu Thr Val Asp Thr Ala Cys 
1220 1225 1230 

Ser Ser Ser Leu Val Ala Leu His Leu Ala Val Gin Ala Leu Arg Arg 
1235 1240 1245 

Gly Glu Cys Gly Leu Ala Leu Ala Gly Gly Ala Thr Val Met Ser Gly 
1250 1255 1260 

Pro Gly Met Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp 
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1265 1270 1275 1280 

Gly Arg Cys Met Pro Phe Ser Ala Asp Ala Asp Gly Thr Ala Trp Ser 
1285 1290 1295 

Glu Gly Val Ala Val Leu Ala Leu Glu Arg Leu Ser Asp Ala Arg Arg 
1300 1305 1310 

. Ala Gly His Arg Val Leu Gly Val Val Arg Gly Ser Ala Val Asn Gin 
1315 1320 1325 

Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Arg Ser Ala Gin Glu 
1330 1335 1340 

Gly Val lie Arg Ala Ala Leu Ala Asp Ala Gly Leu Ala Pro Gly Asp 
1345 1350 1355 1360 

Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro 
1365 1370 1375 



lie Glu Ala Ser Ala Leu Leu Ala Thr Tyr Gly Arg Glu Arg Val Gly 
1380 1385 1390 

Asp Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn Val Gly His Thr Gin 
1395 1400 1405 

Ala Ala Ala Gly Ala Ala Gly Val Val Lys Met Leu Leu Ala Leu Glu 
1410 1415 1420 

His Gly Thr Leu Pro Arg Thr Leu His Ala Asp Arg Pro Ser Thr His 
1425 1430 1435 1440 

Val Asp Trp Ser Ser Gly Thr Val Ala Leu Leu Ala Glu Ala Arg Arg 
1445 1450 1455 

Trp Pro Arg Arg Ser Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe 
1460 1465 1470 

Gly lie Ser Gly Thr Asn Ala His Leu He He Glu Glu Ala Pro Glu 
1475 1480 1485 

Trp Val Glu Asp He Asp Gly Val Ala Ala Pro Asp Arg Gly Thr Ala 
1490 1495 1500 

Asp Ala Ala Ala Pro Ser Pro Leu Leu Leu Ser Ala Arg Ser Glu Gly 
1505 1510 1515 1520 

Ala Leu Arg Ala Gin Ala Val Arg Leu Gly Glu Tyr Val Glu Arg Val 
1525 1530 1535 

Gly Ala Asp Pro Arg Asp Val Ala Tyr Ser Leu Ala Ser Thr Arg Thr 

1540 1545 1550 

Leu Phe Glu His Arg Ala Val Val Pro Cys Gly Gly Arg Gly Glu Leu 
1555 1560 ' " 1565 

Val Ala Ala Leu Gly Gly Phe Ala Ala Gly Arg Val Ser Gly Gly Val 
1570 1575 1580 

Arg Ser Gly Arg Ala Val Pro Gly Gly Val Gly Val Leu Phe Thr Gly 
1585 1590 1595 1600 
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Gin Gly Ala Gin Trp Val Gly Met Gly Arg Gly Leu Tyr Ala Gly Gly 
1605 1610 1615 

Gly Val Phe Ala Glu Val Leu Asp Glu Val Leu Ser Met Val Gly Glu 
1620 1625 1630 

Val Asp Gly Arg Ser Leu Arg Asp Val Met Phe Gly Asp Val Asp Val 
1635 1640 * 1645 

Asp Ala Gly Ala Gly Ala Asp Ala Gly Ala Gly Ala Gly Ala Gly Val 
1650 1655 1660 

Gly Ser Gly Ser Gly Ser Val Gly Gly Leu Leu Gly Arg Thr Glu Phe 
1665 1670 1675 1680 

Ala Gin Pro Ala Leu Phe Ala Leu Glu Val Ala Leu Phe Arg Ala Leu 
1685 1690 1695 

Glu Ala Arg Gly Val Glu Val Ser Val Val Leu Gly His Ser Val Gly 
1700 1705 1710 

Glu Val Ala Ala Ala Tyr Val Ala Gly Val Leu Ser Leu Gly Asp Ala 
1715 1720 1725 

Val Arg Leu Val Val Ala Arg Gly Gly Leu Met Gly Gly Leu Pro Val 
1730 1735 1740 

Gly Gly Gly Met Trp Ser Val Gly Ala Ser Glu Ser Val Val Arg Gly 
1745 1750 1755 1760 

Val Val Glu Gly Leu Gly Glu Trp Val Ser Val Ala Ala Val Asn Gly 
1765 1770 1775 

Pro Arg Ser Val Val Leu Ser Gly Asp Val Gly Val Leu Glu Ser Val 
1780 1785 1790 

Val Ala Ser Leu Met Gly Asp Gly Val Glu Cys Arg Arg Leu Asp Val 
1795 1800 1805 

Ser His Gly Phe His Ser Val Leu Met Glu Pro Val Leu Gly Glu Phe 
1810 1815 1820 

Arg Gly Val Val Glu Ser Leu Glu Phe Gly Arg Val Arg Pro Gly Val 
1825 1830 1835 " 1840 

Val Val Val Ser Gly Val Ser Gly Gly Val Val Gly Ser Gly Glu Leu 
1845 1850 1855 

Gly Asp Pro Gly Tyr Trp Val Arg His Ala Arg Glu Ala Val Arg Phe 
I860 1865 1870 

Ala Asp Gly Val Gly Val Val Arg Gly Leu Gly Val Gly Thr Leu Val 
1875 1880 1885 

Glu Val Gly Pro His Gly Val Leu Thr Gly Met Ala Gly Glu Cys Leu 

1890 1895 1900 

Gly Ala Gly Asp Asp Val Val Val Val Pro Ala Met Arg Arg Gly Arg 

1905 1910 1915 1920 
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Ala Glu Arg Glu Val Phe Glu Ala Ala Leu Ala Thr Val Phe Thr Arg 
1925 1930 1935 

Asp Ala Gly Leu Asp Ala Thr Ala Leu His Thr Gly Ser Thr Gly Arg 
1940 1945 1950 

Arg He Asp Leu Pro Thr Tyr Pro Phe Gin Arg Asp Arg Tyr Trp Leu 
1955 I960 1965 

Asp Pro Val Arg Thr Ala Val Thr Gly Val Glu Pro Ala Gly Ser Pro 
1970 1975 1980 

Ala Asp Ala Arg Ala Thr Glu Arg Gly Arg Ser Thr Thr Ala Gly He 
1985 1990 1995 2000 

Arg Tyr Arg Val Ala Trp Gin Pro Ala Val Val Asp Arg Gly Asn Pro 
2005 2010 2015 

Gly Pro Ala Gly His Val Leu Leu Leu Ala Pro Asp Glu Asp Thr Ala 
2020 2025 2030 

Asp Ser Gly Leu Ala Pro Ala He Ala Arg Glu Leu Ala Val Arg Gly 
2035 2040 2045 

Ala Glu Val His Thr Val Ala Val Pro Val Gly Thr Gly Arg Glu Ala 
2050 2055 2060 

Ala Gly Asp Leu Leu Arg Ala Ala Gly Asp Gly Ala Ala Arg Ser Thr 
2065 2070 2075 2080 

Arg Val Leu Trp Leu Ala Pro Ala Glu Pro Asp Ala Ala Asp Ala Val 
2085 2090 2095 

Ala Leu Val Gin Ala Leu Gly Glu Ala Val Pro Glu Ala Pro Leu Trp 
2100 2105 2110 

He Thr Thr Arg Glu Ala Ala Ala Val Arg Pro Asp Glu Thr Pro Ser 
2115 2120 2125 

Val Gly Gly Ala Gin Leu Trp Gly Leu Gly Gin Val Ala Ala Leu Glu 
2130 2135 2140 

Leu Gly Arg Arg Trp Gly Gly Leu Ala Asp Leu Pro Gly Ser Ala Ser 
2145 ~ ~ & 2150 " 2155 2160 

Pro Ala Val Leu Arg Thr Phe Val Gly Ala Leu Leu Ala Gly Gly Glu 
2165 2170 2175 

Asn Gin Phe Ala Val Arg Pro Ser Gly Val His Val Arg Arg Val Val 
2180 2185 2190 

Pro Ala Pro Val Pro Val Pro Ala Ser Ala Arg Thr Val Thr Thr Ala 
2195 2200 2205 

Pro Ala Thr Ala Val Gly Glu Asp Ala Arg Asn Asp Thr Ser Asp Val 
2210 2215 2220 

Val Val Pro Asp Asp Arg Trp Ser Ser Gly Thr Val Leu He Thr Gly 
2225 2230 2235 2240 
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Gly Thr Gly Ala Leu Gly Ala Gin Val Ala Arg Arg Leu Ala Arg Ser 

2245 2250 2255 

Gly Ala Ala Arg Leu Leu Leu Val Gly Arg Arg Gly Ala Ala Gly Pro 
2260 2265 2270 

Gly Val Gly Glu Leu Val Glu Glu Leu Thr Ala Leu Gly Ser Glu Val 
2275 2280 2285 

Ala Val Glu Ala Cys Asp Val Ala Asp Arg Asp Ala Leu Ala Ala Leu 
2290 2295 2300 

Leu Ala Gly Leu Pro Glu Glu Arg Pro Leu Val Ala Val Leu His Ala 
2305 2310 2315 2320 

Ala Gly Val Leu Asp Asp Gly Val Leu Asp Ser Leu Thr Ser Asp Arg 
2325 2330 2335 

Val Asp Ala Val Leu Arg Asp Lys Val Thr Ala Ala Arg His Leu Asp 
2340 2345 2350 

Glu Leu Thr Ala Asp Leu Pro Leu Asp Ala Phe Val Leu Phe Ser Ser 
2355 2360 2365 

He Val Gly Val Trp Gly Asn Gly Gly Gin Ala Val Tyr Ala Ala Ala 
2370 2375 2380 

Asn Ala Ala Leu Asp Ala Leu Ala Gin Arg Arg Arg Ala Arg Gly Ala 
2385 2390 2395 ~ 2400 

Arg Ala Ala Ser He Ala Trp Gly Pro Trp Ala Gly Ala Gly Met Ala 
2405 2410 2415 

Ser Gly Thr Ala Ala Lys Ser Phe Glu Arg Asp Gly Val Thr Ala Leu 
2420 2425 2430 

Asp Pro Glu Arg Ala Leu Asp Val Leu Asp Asp Val Val Gly Ala Gly 
2435 2440 2445 

Gly Thr Ser Ala Ala Gly Thr His Ala Ala Gly Glu Ser Ser Leu Leu 
2450 2455 2460 

Val Ala Asp Val Asp Trp Glu Thr Phe Val Gly Arg Ser Val Thr Arg 
2465 2470 2475 *" 2480 

Arg Thr Trp Ser Leu Phe Asp Gly Val Ser Ala Ala Arg Ser Ala Arg 
2485 2490 2495 

Ala Gly His Ala Ala Asp Asp Arg Ala Ala Leu Thr Pro Gly Thr Arg 
2500 2505 2510 

Pro Gly Asp Gly Ala Pro Gly Gly Ser Gly Gin Asp Gly Gly Glu Gly 
2515 2520 2525 

Arg Pro Trp Leu Ser Val Gly Pro Ser Pro Ala Glu Arg Arg Arg Ala 
2530 2535 2540 ~ 

Leu Leu Thr Leu Val Arg Ser Glu Ala Ala Gly He Leu Arg His Ala 
2545 2550 2555 2560 

Ser Ala Asp Ala Val Asp Pro Glu Leu Ala Phe Arg Ser Ala Gly Phe 
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2565 2570 2575 

Asp Ser Leu Thr Val Leu Glu Leu Arg Asn Arg Leu Thr Ala Ala Thr 
2580 2585 2590 

Gly Leu Asn Leu Pro Asn Thr Leu Leu Phe Asp His Pro Thr Pro Leu 

2595 2600 2605 

Ser Leu Ala Ser His Leu His Asp Glu Leu Phe Gly Pro Asp Ser Glu 
2610 2615 2620 

Ala Glu Pro Ala Ala Ala Ala Pro Thr Pro Val Met Ala Asp Glu Arg 
2625 2630 2635 2640 

Glu Pro He Ala He Val Gly Met Ala Cys Arg Tyr Pro Gly Gly Val 
2645 2650 2655 

Ala Ser Pro Asp Asp Leu Trp Asp Leu Val Ala Gly Asp Gly His Thr 
2660 2665 2670 

Leu Ser Pro Phe Pro Ala Asp Arg Gly Trp Asp Val Glu Gly Leu Tyr 
2675 2680 2685 

Asp Pro Glu Pro Gly Val Pro Gly Lys Ser Tyr Val Arg Glu Gly Gly 
2690 2695 2700 

Phe Leu Arg Ser Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser 
2705 2710 2715 2720 

Pro Arg Glu Ala Thr Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu 
2725 2730 2735 

Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly He Val Pro Asp Ser Leu 
2740 2745 2750 

Arg Gly Thr Arg Thr Gly Val Phe Ser Gly He Ser Gin Gin Asp Tyr 
2755 2760 2765 

Ala Thr Gin Leu Gly Asp Ala Ala Asp Thr Tyr Gly Gly His Val Leu 
2770 2775 2780 

Thr Gly Thr Leu Gly Ser Val He Ser Gly Arg Val Ala Tyr Ala Leu 
2785 2790 2795 2800 

Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser 
2805 2810 2815 

Leu Val Ala Leu His Leu Ala Val Gin Ser Leu Arg Arg Gly Glu Cys 
2820 2825 2830 

Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Thr Val 
2835 2840 2845 

Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp Gly Arg Cys 
2850 2855 2860 

Lys Ala Phe Ala Glu Gly Ala Asp Gly Thr Ala Trp Ala Glu Gly Val 
2865 2870 2875 2880 

Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His 
2885 2890 2895 
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Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala 
2900 2905 2910 

Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin Arg Val lie 
2915 2920 2925 

Arg Glu Ala Leu Ala Asp Ala Gly Leu Val Pro Ala Asp Val Asp Val 
2930 2935 2940 

Val Glu Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro He Glu Ala 
2945 2950 2955 2960 

Gly Ala Leu Leu Ala Thr Tyr Gly Arg Glu Arg Val Gly Asp Pro Leu 
2965 2970 * 2975 

Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala Ala Ala 
2980 2985 2990 

Gly Val Gly Gly Val He Lys Val Val Gin Gly Met Arg His Gly Ser 
2995 3000 3005 

Leu Pro Arg Thr Leu His Val Asp Ala Pro Ser Ser Lys Val Glu Trp 
3010 3015 3020 

Ala Ser Gly Ala Val Glu Leu Leu Thr Glu Thr Arg Ser Trp Pro Arg 
3025 3030 3035 3040 

Arg Val Glu Arg Val Arg Arg Ala Ala Val Ser Ala Phe Gly Val Ser 
3045 3050 3055 

Gly Thr Asn Ala His Val Val Leu Glu Glu Ala Pro Ala Glu Ala Gly 
3060 3065 3070 

Ser Glu His Gly Asp Gly Pro Glu Pro Glu Arg Pro Asp Ala Val Thr 
3075 3080 3085 

Gly Pro Leu Ser Trp Val Leu Ser Ala Arg Ser Glu Gly Ala Leu Arg 
3090 3095 3100 

Ala Gin Ala Val Arg Leu Arg Glu Cys Val Glu Arg Val Gly Ala Asp 
3105 3110 3115 3120 

Pro Arg Asp Val Ala Gly Ser Leu Val Val Ser Arg Ala Ser Phe Gly 
3125 3130 3135 

Glu Arg Ala Val Val Val Gly Arg Gly Arg Glu Glu Lou Leu Ala Gly 
3140 3145 3150 

Leu Asp Val Val Ala Ala Gly Ala Pro Val Gly Val Ser Ser Gly Ala 
3155 3160 3165 

Gly Ala Val Val Arg Gly Ser Ala Val Arg Gly Arg Gly Val Gly Val 
3170 3175 3180 

Leu Phe Thr Gly Gin Gly Ala Gin Trp Val Gly Met Gly Arg Gly Leu 
3185 3190 3195 3200 

Tyr Ala Gly Gly Gly Val Phe Ala Glu Val Leu Asp Glu Val Leu Ser 
3205 3210 ~ 3215 
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Val Val Gly Glu Val Asp Gly Arg Ser Leu Arg Asp Val Met Phe Ala 
3220 3225 3230 

Asp Ala Asp Ser Val Leu Gly Gly Leu Leu Gly Arg Thr Glu Phe Ala 
3235 3240 3245 

Gin Pro Ala Leu Phe Ala Leu Glu Val Ala Leu Phe Arg Ala Leu Glu 
3250 3255 3260 

Ala Arg Gly Val Glu Val Ser Val Val Leu Gly His Ser Val Gly Glu 
3265 3270 3275 3280 

Val Ala Ala Ala Tyr Val Ala Gly Val Leu Ser Leu Gly Asp Ala Val 
3285 3290 3295 

Arg Leu Val Val Ala Arg Gly Gly Leu Met Gly Gly Leu Pro Val Gly 

3300 3305 3310 

Gly Gly Met Trp Ser Val Gly Ala Ser Glu Ser Val Val Arg Gly Val 
3315 3320 3325 

Val Glu Gly Leu Gly Glu Trp Val Ser Val Ala Ala Val Asn Gly Pro 
3330 3335 3340 

Arg Ser Val Val Leu Ser Gly Asp Val Gly Val Leu Glu Ser Val Val 
3345 3350 3355 3360 

Val Thr Leu Met Gly Asp Gly Val Glu Cys Arg Arg Leu Asp Val Ser 
3365 3370 3375 

His Gly Phe His Ser Val Leu Met Glu Pro Val Leu Gly Glu Phe Arg 
3380 3385 3390 

Gly Val Val Glu Ser Leu Glu Phe Gly Arg Val Arg Pro Gly Val Val 
3395 3400 3405 

Val Val Ser Gly Val Ser Gly Gly Val Val Gly Ser Gly Glu Leu Gly 
3410 3415 3420 

Asp Pro Gly Tyr Trp Val Arg His Ala Arg Glu Ala Val Arg Phe Ala 
3425 3430 3435 3440 

Asp Gly Val Gly Val Val Arg Gly Leu Gly Val Gly Thr Leu Val Glu 
3445 3450 3455 

Val Gly Pro His Gly Val Leu Thr Gly Met Ala Gly Gin Cys Leu Glu 
3460 3465 3470 

Ala Gly Asp Asp Val Val Val Val Pro Ala Met Arg Arg Gly Arg Pro 
3475 3480 3485 

Glu Arg Glu Val Phe Glu Ala Ala Leu Ala Thr Val Phe Thr Arg Asp 
3490 3495 3500 

Ala Gly Leu Asp Ala Thr Thr Leu His Thr Gly Ser Thr Gly Arg Arg 
3505 3510 3515 3520 

lie Asp Leu Pro Thr Tyr Pro Phe Gin His Asn Arg Tyr Trp Ala Thr 
3525 3530 3535 

Gly Ser Val Thr Gly Ala Thr Gly Thr Ser Ala Ala Ala Arg Phe Gly 
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3540 3545 3550 

Leu Glu Trp Lys Asp His Pro Phe Leu Ser Gly Ala Thr Pro lie Ala 
3555 3560 * 3565 

Gly Ser Gly Ala Leu Leu Leu Thr Gly Arg Val Gly Leu Ala Ala His 
3570 3575 3580 

Pro Trp Leu Ala Asp His Ala lie Ser Gly Thr Val Leu Leu Pro Gly 
3585 3590 3595 3 600 

Thr Ala He Ala Asp Leu Leu Leu Arg Ala Val Glu Glu Val Gly Ala 
3605 3610 3615 

Gly Gly Val Glu Glu Leu Thr Leu His Glu Pro Leu Leu Leu Pro Glu 
3620 3625 3630 

Arg Gly Gly Leu His Val Gin Val Leu Val Glu Ala Ala Asp Glu Gin 
3635 3640 3645 

Gly Arg Arg Ala Val Ala Val Ala Ala Arg Pro Glu Gly Pro Gly Arg 

3650 3655 3660 

Asp Gly Glu Glu Gin Glu Trp Thr Arg His Ala Glu Gly Val Leu Thr 
3665 3670 3675 3680 

Ser Thr Glu Thr Ala Val Pro Asp Met Gly Trp Ala Ala Gly Ala Trp 
3685 3690 * * 3695 

Pro Pro Pro Gly Ala Glu Pro He Asp Val Glu Glu Leu Tyr Asp Ala 
3700 3705 3710 

Phe Ala Ala Asp Gly Tyr Gly Tyr Gly Pro Ala Phe Thr Ala Leu Ser 
3715 3720 3725 

Gly Val Trp Arg Leu Gly Asp Glu Leu Phe Ala Glu Val Arg Arg Pro 
3730 3735 3740 

Ala Gly Gly Ala Gly Thr Thr Gly Asp Gly Phe Gly Val His Pro Ala 
3745 3750 3755 3760 

Leu Phe Asp Ala Ala Leu His Pro Trp Arg Ala Gly Gly Leu Leu Pro 
3765 3770 ' 3775 

Asp Thr Gly Gly Thr Thr Trp Ala Pro Phe Ser Trp Gin Gly He Ala 
3780 3785 3790 

Leu His Thr Thr Gly Ala Glu Thr Leu Arg Val Arg Leu Ala Pro Ala 
3795 3800 3805 

Ala Gly Gly Thr Glu Ser Ala Phe Ser Val Gin Ala Ala Asp Pro Ala 
3810 3815 3820 

Gly Thr Pro Val Leu Thr Leu Asp Ala Leu Leu Leu Arg Pro Val Thr 
3825 3830 3835 3840 

Leu Gly Arg Ala Asp Ala Pro Gin Pro Leu Tyr Arg Val Asp Trp Gin 
3845 3850 ~ 3855 

Pro Val Gly Gin Gly Thr Glu Ala Ser Gly Ala Gin Gly Trp Thr Val 
3860 3865 3870 
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Leu Gly Gin Ala Ala Ala Glu Thr Val Ala Gin Pro Ala Ala His Ala 
3875 3880 3885 

Asp Leu Thr Ala Leu Arg Thr Ala Val Ala Ala Ala Gly Thr Pro Val 
3890 3895 3900 

Pro Arg Leu Val Val Val Ser Pro Val Asp Thr Arg Leu Asp Glu Gly 
3905 3910 3915 3920 

Pro Val Leu Ala Asp Ala Glu Ala Arg Ala Arg Ala Gly Asp Gly Trp 
3925 3930 3935 

Asp Asp Asp Pro Leu Arg Val Ala Leu Gly Arg Gly Leu Thr Leu Val 
3940 3945 3950 

Arg Glu Trp Val Glu Asp Glu Arg Leu Ala Asp Ser Arg Leu Val Val 
3955 3960 3965 

Leu Thr Arg Gly Ala Val Ala Ala Gly Pro Gly Asp Val Pro Asp Leu 
3970 3975 3980 

Thr Gly Ala Ala Leu Trp Gly Leu Leu Arg Ser Ala Gin Ser Glu Tyr 
3985 3990 3995 4000 

Pro Asp Arg Phe Thr Leu He Asp Val Asp Asp Ser Pro Glu Ser Arg 

4005 4010 4015 

Ala Ala Leu Pro Arg Ala Leu Gly Ser Ala Glu Arg Gin Leu Ala Leu 
4020 4025 4030 

Arg Thr Gly Asp Val Leu Ala Pro Ala Leu Val Pro Met Ala Thr Arg 
4035 ~ 4040 4045 

Pro Ala Glu Thr Thr Pro Ala Thr Ala Val Ala Ser Ala Thr Thr Gin 
4050 4055 4060 

Thr Gin Val Thr Ala Pro Ala Pro Asp Asp Pro Ala Ala Asp Ala Val 
4065 4070 4075 4080 

Phe Asp Pro Ala Gly Thr Val Leu He Thr Gly Gly Thr Gly Ala Leu 
4085 4090 4095 

Gly Arg Arg Val Ala Ser His Leu Ala Arg Arg Tyr Gly Val Arg His 
4100 4105 4110 

Met Leu Leu Val Ser Arg Arg Gly Pro Asp Ala Pro Glu Ala Gly Pro 
4115 4120 4125 

Leu Glu Arg Glu Leu Ala Gly Leu Gly Val Thr Ala Thr Phe Leu Ala 
4130 4135 4140 

Cys Asp Leu Thr Asp He Glu Ala Val Arg Lys Ala Val Ala Ala Val 
4145 4150 4155 4160 

Pro Ser Asp His Pro Leu Thr Gly Val Val His Thr Ala Gly Val Leu 
4165 4170 4175 

Asp Asp Gly Ala Leu Thr Gly Leu Thr Arg Gin Arg Leu Asp Thr Val 
4180 4185 4190 
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Leu Arg Pro Lys Ala Asp Ala Val Arg Asn Leu His Glu Ala Thr Leu 
4195 4200 4205 

Asp Arg Pro Leu Arg Ala Phe Val Leu Phe Ser Ala Ala Ala Gly Leu 
4210 4215 4220 

Leu Gly Arg Pro Gly Gin Ala Ser Tyr Ala Ala Ala Asn Ala Val Leu 
4225 4230 4235 4240 

Asp Ala Leu Ala Gly Ala Arg Arg Ala Ala Gly Leu Pro Ala Val Ser 
4245 4250 4255 

Leu Ala Trp Gly Leu Trp Asp Glu Gin Thr Gly Met Ala Gly Gly Leu 
4260 4265 4270 

Asp Glu Met Ala Leu Arg Val Leu Arg Arg Asp Gly He Ala Ala Met 
4275 4280 4285 

Pro Pro Glu Gin Gly Leu Glu Leu Leu Asp Leu Ala Leu Thr Gly His 
4290 4295 * 4300 

Arg Asp Gly Pro Ala Val Leu Val Pro Leu Leu Leu Asp Gly Ala Ala 
4305 4310 4315 * 4320 

Leu Arg Arg Thr Ala Lys Glu Arg Gly Ala Ala Thr Met Ser Pro Leu 
4325 4330 4335 

Leu Arg Ala Leu Leu Pro Ala Ala Leu Arg Arg Ser Gly Gly Ala Gly 
4340 4345 4350 

Ala Pro Ala Ala Ala Asp Arg His Gly Lys Glu Ala Asp Pro Gly Ala 

4355 4360 4365 

Gly Arg Leu Ala Gly Met Val Ala Leu Glu Ala Ala Glu Arg Ser Ala 
4370 4375 4380 

Ala Val Leu Glu Leu Val Thr Glu Gin Val Ala Glu Val Leu Gly Tyr 
43 *5 4390 4395 4400 

Ala Ser Ala Ala Glu He Glu Pro Glu Arg Pro Phe Arg Glu He Gly 
4405 4410 4415 

Val Asp Ser Leu Ala Ala Val Glu Leu Arg Asn Arg Leu Ser Arg Leu 
4420 4425 " 4430 

Val Gly Leu Arg Leu Pro Thr Thr Leu Ser Phe Asp His Pro Thr Pro 
4435 4440 4445 

Lys Asp Met Ala Gin His He Asp Gly Gin Leu Pro Arg Pro Ala Gly 
4450 4455 4460 

Ala Ser Pro Ala Asp Ala Ala Leu Glu Gly He Gly Asp Leu Ala Arg 
4465 4470 4475 * 4480 

Ala Val Ala Leu Leu Gly Thr Gly Asp Ala Arg Arg Ala Glu Val Arg 
4485 4490 4495 

Glu Gin Leu Val Gly Leu Leu Ala Ala Leu Asp Pro Pro Gly Arg Thr 
4500 4505 4510 

Gly Thr Ala Ala Pro Gly Val Pro Ser Gly Ala Asp Gly Ala Glu Pro 
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10 



15 



20 



25 



4515 4520 4525 

Thr Val Thr Asp Arg Leu Asp Glu Ala Thr Asp Asp Glu lie Phe Ala 
4530 " 4535 4540 

Phe Leu Asp Glu Gin Leu 
4545 4550 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1996 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Met Thr Ala Glu Asn Asp Lys He Arg Ser Tyr Leu Lys Arg Ala Thr 
1 5 10 15 

Ala Glu Leu His Arg Thr Lys Ser Arg Leu Ala Glu Val Glu Ser Ala 
20 25 30 

Ser Arg Glu Pro He Ala He Val Gly Met Ala Cys Arg Tyr Pro Gly 
35 40 45 

Gly Val Ala Ser Pro Asp Asp Leu Trp Asp Leu Val Ala Ala Gly Thr 
50 55 60 

Asp Ala Val Ser Ala Phe Pro Val Asp Arg Gly Trp Asp Val Glu Gly 
65 70 75 80 

Leu Tyr Asp Pro Asp Pro Glu Ala Val Gly Arg Ser Tyr Val Arg Glu 
85 90 95 

Gly Gly Phe Leu His Ser Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly 
100 105 110 

He Ser Pro Arg Glu Ala Ala Ala Met Asp Pro Gin Gin Arg Leu Leu 
115 " 120 125 

Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly He Val Pro Ala 
130 135 140 

Ser Leu Arg Gly Thr Arg Thr Gly Val Phe Thr Gly Val Met Tyr Asp 
145 150 155 160 

Asp Tyr Gly Ser Arg Phe Asp Ser Ala Pro Pro Glu Tyr Glu Gly Tyr 
165 170 175 

so Leu Val Asn Gly Ser Ala Gly Ser He Ala Ser Gly Arg Val Ala Tyr 

180 185 190 

Ala Leu Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala Cys Ser 
195 200 205 



30 



35 



40 



45 



55 



Ser Ser Leu Val Ala Leu His Leu Ala Val Gin Ser Leu Arg Arg Gly 
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210 



215 



220 



Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro 
225 * 230 235 240 

Thr Val Leu Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp Gly 
245 250 255 

Arg Cys Lys Ala Phe Ala Glu Gly Ala Asp Gly Thr Ala Trp Ala Glu 
260 265 270 

Gly Val Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Asn 
275 280 285 

Gly His Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp 
290 " 295 300 

Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin Arg 
305 310 315 320 

Val He Arg Glu Ala Leu Ala Asp Ala Gly Leu Thr Pro Ala Asp Val 
325 330 335 

Asp Ala Val Glu Ala His Gly Thr Gly Thr Pro Leu Gly Asp Pro He 
340 345 350 

Glu Ala Gly Ala Leu Leu Ala Thr Tyr Gly Ser Glu Arg Gin Gly Gin 
355 360 365 

Gly Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin 
370 " ~ 375 380 

Ala Ala Ala Gly Val Gly Gly Val He Lys Val Val Gin Ala Met Arg 
385 390 395 400 

His Gly Ser Leu Pro Arg Thr Leu His Val Asp Ala Pro Ser Ser Lys 
405 410 415 



Val Glu Trp Ala Ser Gly Ala Val Glu Leu Leu Thr Glu Thr Arg Ser 

420 425 430 

Trp Pro Arg Arg Val Glu Arg Val Arg Arg Ala Ala Val Ser Ala Phe 
435 ~ 440 445 

Gly Val Ser Gly Thr Asn Ala His Val Val Leu Glu Glu Ala Pro Ala 
450 455 460 

Glu Ala Gly Ser Glu His Gly Asp Gly Pro Glu Pro Glu Arg Pro Asp 
465 470 475 480 

Ala Val Thr Gly Pro Leu Ser Trp Val Leu Ser Ala Arg Ser Glu Gly 
485 490 495 

Ala Leu Arg Ala Gin Ala Val Arg Leu Arg Glu Cys Val Glu Arg Val 
500 505 510 

Gly Ala Asp Pro Arg Asp Val Ala Gly Ser Leu Val Val Ser Arg Ala 
515 " 520 525 

Ser Phe Gly Glu Arg Ala Val Val Val Gly Arg Gly Arg Glu Glu Leu 
530 535 540 
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10 



15 



Leu Ala Gly Leu Asp Val Val Ala Ala Gly Ala Pro Val Gly Val Ser 
545 550 555 560 

Gly Gly Val Ser Ser Gly Ala Gly Ala Val Val Arg Gly Ser Ala Val 
565 570 " 575 

Arg Gly Arg Gly Val Gly Val Leu Phe Thr Gly Gin Gly Ala Gin Trp 
580 585 A 590 

Val Gly Met Gly Arg Gly Leu Tyr Ala Gly Gly Gly Val Phe Ala Glu 
595 600 605 

Val Leu Asp Glu Val Leu Ser Val Val Gly Glu Val Gly Gly Trp Ser 
610 615 620 

Leu Arg Asp Val Met Phe Gly Asp Val Asp Val Asp Ala Gly Ala Gly 
625 630 635 640 



20 



Ala Asp Ala Gly Val Gly Ser Gly Val Gly Val Gly Gly Leu Leu Gly 
645 650 * 655 



Arg Thr Glu Phe Ala Gin Pro Ala Leu Phe Ala Leu Glu Val Ala Leu 
660 665 670 



25 



Phe Arg Ala Leu Glu Ala Arg Gly Val Glu Val Ser Val Val Leu Gly 
675 680 685 



His Ser Val Gly Glu Val Ala Ala Ala Tyr Val Ala Gly Val Leu Ser 
690 695 700 



30 



Leu Gly Asp Ala Val Arg Leu Val Val Ala Arg Gly Gly Leu Met Gly 
705 710 715 720 



35 



Gly Leu Pro Val Gly Gly Gly Met Trp Ser Val Gly Ala Ser Glu Ser 
725 730 - 735 

Val Val Arg Gly Val Val Glu Gly Leu Gly Glu Trp Val Ser Val Ala 
740 745 750 



40 



45 



Ala Val Asn Gly Pro Arg Ser Val Val Leu Ser Gly Asp Val Gly Val 
755 760 765 

Leu Glu Ser Val Val Ala Ser Leu Met Gly Asp Gly Val Glu Cys Arg 

770 775 780 

Arg Leu Asp Val Ser His Gly Phe His Ser Val Leu Met Glu Pro Val 
785 790 795 800 

Leu Gly Glu Phe Arg Gly Val Val Glu Ser Leu Glu Phe Gly Arg Val 
805 810 815 



50 



Arg Pro Gly Val Val Val Val Ser Ser Val Ser Gly Gly Val Val Gly 
820 825 830 

Ser Gly Glu Leu Gly Asp Pro Gly Tyr Trp Val Arg His Ala Arg Glu 
835 840 845 



55 



Ala Val Arg Phe Ala Asp Gly Val Gly Val Val Arg Gly Leu Gly Val 
850 855 860 
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Gly Thr Leu Val Glu Val Gly Pro His Gly Val Leu Thr Gly Met Ala 
865 870 875 880 

Gly Glu Cys Leu Gly Ala Gly Asp Asp Val Val Val Val Pro Ala Met 
885 890 895 

Arg Arg Gly Arg Ala Glu Arg Glu Val Phe Glu Ala Ala Leu Ala Thr 
900 905 910 

Val Phe Thr Arg Asp Ala Gly Leu Asp Ala Thr Thr Leu His Thr Gly 
915 920 925 

Ser Thr Gly Arg Arg lie Asp Leu Pro Thr Tyr Pro Phe Gin His Asp 
930 " 935 940 

Arg Tyr Trp Leu Ala Ala Pro Ser Arg Pro Arg Thr Asp Gly Leu Ser 
945 * 950 955 960 

Ala Ala Gly Leu Arg Glu Val Glu His Pro Leu Leu Thr Ala Ala Val 
965 970 975 

Glu Leu Pro Gly Thr Asp Thr Glu Val Trp Thr Gly Arg lie Ser Ala 
980 985 990 

Ala Asp Leu Pro Trp Leu Ala Asp His Leu Val Trp Asp Arg Gly Val 
995 1000 1005 

Val Pro Gly Thr Ala Leu Leu Glu Thr Val Leu Gin Val Gly Ser Arg 
1010 1015 1020 

He Gly Leu Pro Arg Val Ala Glu Leu Val Leu Glu Thr Pro Leu Thr 
1025 1030 1035 1040 

Trp Thr Ser Asp Arg Pro Leu Gin Val Arg He Val Val Thr Ala Ala 
1045 1050 1055 

Ala Thr Ala Pro Gly Gly Ala Arg Glu Leu Thr Leu His Ser Arg Pro 
1060 1065 1070 

Glu Pro Val Ala Ala Ser Ser Ser Ser Pro Ser Pro Ala Ser Pro Arg 
1075 1080 1085 

His Leu Thr Ala Gin Glu Ser Asp Asp Asp Trp Thr Arg His Ala Ser 
1090 1095 1100 

Gly Leu Leu Ala Pro Ala Ala Gly Leu Ala Asp Asp Phe Ala Glu Leu 
1105 1110 1115 1120 

Thr Gly Ala Trp Pro Pro Val Gly Ala Glu Pro Leu Asp Leu Ala Gly 

1125 1130 1135 

Gin Tyr Pro Leu Phe Ala Ala Ala Gly Val Arg Tyr Glu Gly Ala Phe 
1140 1145 1150 

Arg Gly Leu Arg Ala Ala Trp Arg Arg Gly Asp Glu Val Phe Ala Asp 
1155 1160 1165 

Val Arg Leu Pro Asp Ala His Ala Val Asp Ala Asp Arg Tyr Gly Val 
1170 1175 1180 

His Pro Ala Leu Leu Asp Ala Val Leu His Pro He Ala Ser Leu Asp 
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1190 1195 1200 

Pro Leu Gly Asp Gly Gly His Gly Leu Leu Pro Phe Ser Trp Thr Asp 
1205 1210 1215 

Val Gin Gly His Gly Ala Gly Gly His Ala Leu Arg Val Arg Val Ala 
1220 1225 1230 

Ala Val Asp Gly Gly Ala Val Ser Val Thr Ala Ala Asp His Ala Gly 
1235 1240 1245 

Asn Pro Val Leu Ser Ala Arg Ser Leu Ala Leu Arg Arg He Thr Ala 
1250 1255 1260 

Asp Arg Leu Pro Ala Ala Pro Val Ala Pro Leu Tyr Arg Val Asp Trp 
1265 1270 1275 " 1280 

Leu Pro Phe Pro Gly Pro Val Pro Val Ser Ala Gly Gly Arg Trp Ala 
1285 1290 1295 

Val Val Gly Pro Glu Ala Glu Ala Thr Ala Ala Gly Leu Arg Ala Val 
1300 1305 1310 

Gly Leu Asp Val Arg Thr His Ala Leu Pro Leu Gly Glu Pro Leu Pro 
1315 1320 1325 

Pro Gin Ala Gly Thr Asp Ala Glu Val He He Leu Asp Leu Thr Thr 
1330 1335 1340 

Thr Ala Ala Gly Arg Thr Ala Ser Asp Gly Gly Arg Leu Ser Leu Leu 
1345 1350 1355 1360 

Asp Glu Val Arg Ala Thr Val Arg Arg Thr Leu Glu Ala Val Gin Ala 
1365 1370 1375 

Arg Leu Ala Asp Thr Glu Thr Ala Pro Asp Val Asp Val Arg Thr Ala 
1380 1385 1390 

Ala Arg Pro Arg Thr Ala Ala Arg Thr Ser Pro Arg Val Asp Thr Arg 
1395 1400 1405 

Thr Gly Ala Arg Thr Ala Asp Gly Pro Arg Leu Val Val Leu Thr Arg 
1410 1415 1420 

Gly Ala Ala Gly Pro Glu Gly Gly Ala Ala Asp Pro Ala Gly Ala Ala 
1425 1430 1435 1440 

Val Trp Gly Leu Val Arg Val Ala Gin Ala Glu Gin Pro Gly Arg Phe 
1445 1450 1455 

Thr Leu Val Asp Val Asp Gly Thr Gin Ala Ser Leu Arg Ala Leu Pro 
1460 1465 1470 

Gly Leu Leu Ala Thr Asp Ala Gly Gin Ser Ala Val Arg Asp Gly Ara 

1475 1480 1485 

Val Thr Val Pro Arg Leu Val Pro Val Ala Asp Pro Val Pro His Glv 
1490 1495 1500 

Gly Gly Thr Ala Ala Asp Gly Thr Gly Ala Gly Glu Pro Ser Ala Thr 
1505 1510 1515 1520 
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Leu Asp Pro Glu Gly Thr Val Leu lie Thr Gly Gly Thr Gly Ala Leu 
1525 1530 1535 

Ala Ala Glu Thr Ala Arg His Leu Val Asp Arg His Lys Val Arg His 
1540 1545 1550 

Leu Leu Leu Val Gly Arg Arg Gly Pro Asp Ala Pro Gly Val Asp Arg 
1555 1560 1565 

Leu Val Ala Glu Leu Thr Glu Ser Gly Ala Glu Val Ala Val Arg Ala 
1570 1575 1580 

Cys Asp Val Thr Asp Arg Asp Ala Leu Arg Arg Leu Leu Asp Ala Leu 
1585 1590 1595 1600 

Pro Asp Glu His Pro Leu Thr Cys Val Val His Thr Ala Gly Val Leu 
1605 1610 1615 

Asp Asp Gly Val Leu Ser Ala Gin Thr Ala Glu Arg lie Asp Thr Val 
1620 1625 1630 

Leu Arg Pro Lys Ala Asp Ala Ala Val His Leu Asp Glu Leu Thr Arg 
1635 1640 1645 

Glu lie Gly Arg Val Pro Leu Val Leu Tyr Ser Ser Val Ser Ala Thr 
1650 1655 1660 

Leu Gly Ser Ala Gly Gin Ala Gly Tyr Ala Ala Ala Asn Ala Phe Met 
1665 1670 1675 1680 

Asp Ala Leu Ala Ala Arg Arg Cys Ala Ala Gly His Pro Ala Leu Ser 
1685 1690 1695 

Leu Gly Trp Gly Trp Trp Ser Gly Val Gly Leu Ala Thr Gly Leu Asp 
1700 1705 1710 

Gly Ala Asp Ala Ala Arg Val Arg Arg Ser Gly Leu Ala Pro Leu Asp 
1715 1720 1725 

Ala Gly Ala Ala Leu Asp Leu Leu Asp Arg Ala Leu Thr Arg Pro Glu 
1730 1735 1740 

Pro Ala Leu Leu Pro Val Arg Leu Asp Leu Arg Ala Ala Ala Gly Ala 
1745 1750 1755 1760 

Thr Ala Leu Pro Glu Val Leu Arg Asp Leu Ala Gly Val Pro Ala Asp 
1765 1770 1775 

Ala Arg Ser Thr Pro Gly Ala Ala Ala Gly Thr Gly Asp Glu Asp Gly 
1780 1785 1790 

Ala Val Arg Pro Ala Pro Ala Pro Ala Asp Ala Ala Gly Thr Leu Ala 
1795 1800 1805 

Ala Arg Leu Ala Gly Arg Ser Ala Pro. Glu Arg Thr Ala Leu Leu Leu 
1810 1815 1820 

Asp Leu Val Arg Thr Glu Val Ala Ala Val Leu Gly His Gly Asp Pro 
1825 1830 1835 1840 
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Ala Ala He Gly Ala Ala Arg Thr Phe Lys Asp Ala Gly Phe Asp Ser 
1845 1850 1855 

Leu Thr Ala Val Asp Leu Arg Asn Arg Leu Asn Thr Arg Thr Gly Leu 
1860 1865 1870 

Arg Leu Pro Ala Thr Leu Val Phe Asp His Pro Thr Pro Leu Ala Leu 
1875 1880 1885 

Ala Glu Leu Leu Leu Asp Gly Leu Glu Ala Ala Gly Pro Ala Glu Pro 
1890 1895 1900 

Ala Ala Glu Val Pro Asp Glu Ala Ala Gly Ala Glu Thr Leu Ser Gly 
1905 1910 1915 1920 

Val He Asp Arg Leu Glu Arg Ser Leu Ala Ala Thr Asp Asp Gly Asp 
1925 1930 ~ 1935 

Ala Arg Val Arg Ala Ala Arg Arg Leu Arg Gly Leu Leu Asp Ala Leu 
1940 1945 1950 

Pro Ala Gly Pro Gly Ala Ala Ser Gly Pro Asp Ala Gly Glu His Ala 
1955 1960 1965 

Pro Gly Arg Gly Asp Val Val He Asp Arg Leu Arg Ser Ala Ser Asp 
1970 1975 1980 

Asp Asp Leu Phe Asp Leu Leu Asp Ser Asp Phe Gin 
1985 1990 1995 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3724 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Ala Thr Asn Glu Glu Lys Leu Arg Glu Tyr Leu Arg Arg Ala 
1 5 10 15 

Met Ala Asp Leu His Ser Ala Arg Glu Arg Leu Arg Glu Val Glu Ser 
20 25 30 

Ala Ser Arg Glu Pro He Ala He Val Gly Met Ala Cys Arg Tyr Pro 
35 40 45 

Gly Gly Val Ala Ser Pro Glu Glu Leu Trp Asp Leu Val Ala Ala Gly 
50 55 60 

Thr Asp Ala He Ser Pro Phe Pro Val Asp Arg Gly Trp Asp Ala Glu 
65 70 75 " 80 

Gly Leu Tyr Asp Pro Glu Pro Gly Val Pro Gly Lys Ser Tyr Val Arg 
85 90 ~ ^ 95 
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Glu Gly Gly Phe Leu His Ser Ala Ala Glu Phe Asp Ala Glu Phe Phe 

100 105 110 

Gly He Ser Pro Arg Glu Ala Ala Ala Met Asp Pro Gin Gin Arg Leu 

115 120 125 

Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly He Val Pro 
130 135 140 

Ala Ser Leu Arg Gly Thr Arg Thr Gly Val Phe Thr Gly Val Met Tyr 
145 150 155 160 

His Asp Tyr Gly Ser His Gin Val Gly Thr Ala Ala Asp Pro Ser Gly 
165 170 175 

Gin Leu Gly Leu Gly Thr Ala Gly Ser Val Ala Ser Gly Arg Val Ala 
180 185 190 

Tyr Thr Leu Gly Leu Gin Gly Pro Ala Val Thr Met Asp Thr Ala Cys 
195 200 205 

Ser Ser Ser Leu Val Ala Leu His Leu Ala Val Gin Ser Leu Arg Arg 
210 215 220 

Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Ala Thr Val Leu Ala Thr 
225 230 235 240 

Pro Thr Val Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp 
245 250 255 

Gly Arg Cys Lys Ala Phe Ala Glu Gly Ala Asp Gly Thr Ala Trp Ala 
260 265 270 

Glu Gly Ala Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg 
275 280 285 

Asn Gly His Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin 
290 295 300 

Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin 
305 310 315 320 

Arg Val He Arg Asp Ala Leu Ala Asp Ala Gly Leu Thr Pro Ala Asp 
325 330 335 

Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Pro Leu Gly Asp Pro 
340 345 350 

He Glu Ala Gly Ala Leu Met Ala Thr Tyr Gly Ser Glu Arg Val Gly 
355 360 365 

Asp Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Thr Gin 
370 375 380 

Ala Ala Ala Gly Ala Ala Gly Val He Lys Met Val Gin Ala Leu Arg 
385 390 395 400 



Gin Ser Glu Leu Pro Arg Thr Leu His Val Asp Ala Pro Ser Ala Lys 
405 410 415 
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Val Glu Trp Asp Ala Gly Ala Val Gin Leu Leu Thr Gly Val Arg Pro 
420 425 430 

Trp Pro Arg Arg Glu His Arg Pro Arg Arg Ala Ala Val Ser Ala Phe 
435 440 445 

Gly Val Ser Gly Thr Asn Ala His Val He He Glu Glu Pro Pro Ala 

450 455 460 

Ala Gly Asp Thr Ser Pro Ala Gly Asp Thr Pro Glu Pro Gly Glu Ala 

465 470 475 ~ 480 

Thr Ala Ser Pro Ser Thr Ala Ala Gly Pro Ser Ser Pro Ser Ala Val 
485 490 495 

Ala Gly Pro Leu Ser Pro Ser Ser Pro Ala Val Val Trp Pro Leu Ser 
500 505 510 

Ala Glu Thr Ala Pro Ala Leu Arg Ala Gin Ala Ala Arg Leu Arg Ala 
515 520 525 

His Leu Glu Arg Leu Pro Gly Thr Ser Pro Thr Asp He Gly His Ala 
530 535 540 

Leu Ala Ala Glu Arg Ala Ala Leu Thr Arg Arg Val Val Leu Leu Gly 
545 550 555 560 

Asp Asp Gly Ala Pro Val Asp Ala Leu Ala Ala Leu Ala Ala Gly Glu 
565 570 575 

Thr Thr Pro Asp Ala Val His Gly Thr Ala Ala Asp He Arg Arg Val 
580 585 590 

Ala Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Ala Gly Met Gly Ala 
595 600 605 

Glu Leu Leu Asp Thr Ala Pro Ala Phe Ala Ala Glu Leu Asp Arg Cys 
610 615 620 



Gin Gly Ala Leu Ser Pro Tyr Val Asp Trp Asn Leu Ala Asp Val Leu 
625 630 635 640 

Arg Gly Ala Pro Ala Ala Pro Gly Leu Asp Arg Val Asp Val Val Gin 
645 650 * 655 

Pro Ala Thr Phe Ala Val Met Val Gly Leu Ala Ala Leu Trp Arg Ser 
660 665 670 

Leu Gly Val Glu Pro Ala Ala Val He Gly His Ser Gin Gly Glu He 
675 680 685 

Ala Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Arg 
690 695 700 

He Val Ala Leu Arg Ser Gin Val He Ala Arg Glu Leu Ala Gly Ara 
705 710 715 720 

Gly Gly Met Ala Ser Val Ala Leu Pro Ala Ala Glu Val Glu Ala Arg 
725 730 735 

Leu Ala Gly Gly Val Glu He Ala Ala Val Asn Gly Pro Gly Ser Thr 
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740 745 750 

Val Val Cys Gly Glu Pro Gly Ala Leu Glu Ala Leu Leu Val Thr Leu 
755 760 765 

Glu Ser Glu Gly Thr Arg Val Arg Arg He Asp Val Asp Tyr Ala Ser 
770 775 780 

His Ser His Tyr Val Glu Ser lie Arg Ala Glu Leu Ala Thr Val Leu 
785 790 795 800 

Gly Pro Val Arg Pro Arg Arg Gly Asp Val Pro Phe Tyr Ser Thr Val 

805 810 815 

Glu Ala Ala Leu Leu Asp Thr Ala Thr Leu Asp Ala Asp Tyr Trp Tyr 
820 825 830 

Arg Asn Leu Arg Leu Pro Val Arg Phe Glu Pro Thr Val Arg Ala Met 
835 840 845 

Leu Asp Asp Gly Val Asp Ala Phe Val Glu Cys Ser Ala His Pro Val 
20 850 855 860 

Leu Thr Val Gly Val Arg Gin Thr Val Glu Ser Ala Gly Gly Ala Val 
865 870 875 880 



15 



25 



Pro Ala Leu Ala Ser Leu Arg Arg Asp Glu Gly Gly Leu Arg Arg Phe 
885 890 895 

Leu Thr Ser Ala Ala Glu Ala Gin Val Val Gly Val Pro Val Asp Trp 
900 905 910 

30 Ala Thr Leu Arg Pro Gly Ala Gly Arg Val Asp Leu Pro Thr Tyr Ala 

915 920 925 

Phe Gin Arg Glu Arg His Trp Val Gly Pro Ala Arg Pro Asp Ser Ala 
930 935 940 

35 Ala Thr Ala Ala Thr Thr Gly Asp Asp Ala Pro Glu Pro Gly Asp Arg 

945 950 955 960 

Leu Gly Tyr His Val Ala Trp Lys Gly Leu Arg Ser Thr Thr Gly Gly 
965 970 975 

Trp Arg Pro Gly Leu Arg Leu Leu He Val Pro Thr Gly Asp Gin Tyr 
980 985 990 

Thr Ala Leu Ala Asp Thr Leu Glu Gin Ala Val Ala Ser Phe Gly Gly 
995 1000 1005 

Thr Val Arg Arg Val Ala Phe Asp Pro Ala Arg Thr Gly Arg Ala Glu 
1010 1015 1020 

Leu Phe Gly Leu Leu Glu Thr Glu He Asn Gly Asp Thr Ala Val Thr 
so 1025 1030 1035 1040 

Gly Val Val Ser Leu Leu Gly Leu Cys Thr Asp Gly Arg Pro Asp His 
1045 1050 1055 

55 Pro Ala Val Pro Val Ala Val Thr Ala Thr Leu Ala Leu Val Gin Ala 

1060 1065 1070 



40 



45 
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Leu Ala Asp Leu Gly Ser Thr Ala Pro Leu Trp Thr Val Thr Cys Gly 
1075 1080 1085 

Ala Val Ala Thr Ala Pro Asp Glu Leu Pro Cys Thr Ala Gly Ala Gin 
1090 1095 1100 

Leu Trp Gly Leu Gly Arg Val Ala Ala Leu Glu Leu Pro Glu Val Trp 

1110 1115 1120 

Gly Gly Leu He Asp Leu Pro Ala Arg Pro Asp Ala Arg Val Leu Asp 
1125 1130 1135 

Arg Leu Ala Gly Val Leu Ala Glu Pro Gly Gly Glu Asp Gin He Ala 
1140 1145 1150 

Val Arg Met Ala Gly Val Phe Gly Arg Arg Val Leu Arg Asn Pro Ala 

1155 1160 1165 

Asp Ser Arg Pro Pro Ala Trp Arg Ala Arg Gly Thr Val Leu He Ala 

1170 1175 1180 

Gly Asp Leu Thr Thr Val Pro Gly Arg Leu Val Arg Ser Leu Leu Glu 
H85 1190 1195 1200 

Asp Gly Ala Asp Arg Val Val Leu Ala Gly Pro Asp Ala Pro Ala Gin 
1205 1210 1215 

Ala Ala Ala Ala Gly Leu Thr Gly Val Ser Leu Val Pro Val Arg Cys 
1220 1225 1230 

Asp Val Thr Asp Arg Ala Ala Leu Ala Ala Leu Leu Asp Glu His Ala 
30 1235 1240 1245 

Pro Thr Val Ala Val His Ala Pro Pro Leu Val Pro Leu Ala Pro Leu 
1250 1255 1260 



20 



25 



35 



Arg Glu Thr Ala Pro Gly Asp He Ala Ala Ala Leu Ala Ala Lys Thr 
1265 1270 1275 1280 

Thr Ala Ala Gly His Leu Val Asp Leu Ala Pro Ala Ala Gly Leu Asp 
1285 1290 1295 

40 Ala Leu Val Leu Phe Ser Ser Val Ser Gly Val Trp Gly Gly Ala Ala 

1300 1305 ~ 1310 

Gin Gly Gly Tyr Ala Ala Ala Ser Ala His Leu Asp Ala Leu Ala Glu 
1315 1320 1325 



45 



50 



55 



Arg Ala Arg Ala Ala Gly Val Pro Ala Phe Ser Val Ala Trp Ser Pro 
1330 1335 1340 

Trp Ala Gly Gly Thr Pro Ala Asp Gly Ala Glu Ala Glu Phe Leu Ser 
1345 1350 1355 1360 

Arg Arg Gly Leu Ala Pro Leu Asp Pro Asp Gin Ala Val Arg Thr Leu 
1365 1370 1375 

Arg Arg Met Leu Glu Arg Gly Ser Ala Cys Gly Ala Val Ala Asp Val 
1380 1385 1390 



159 



EP 0 791 655 A2 



35 



45 



55 



Glu Trp Ser Arg Phe Ala Ala Ser Tyr Thr Trp Val Arg Pro Ala Val 
1395 1400 1405 
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1570 1575 1580 

Gly Trp Asp Asn Glu Thr Leu Tyr Asp Pro Asp Pro Asp Ser Pro Gly 
1585 1590 1595 1600 

His His Thr Tyr Val Arg Glu Gly Gly Phe Leu His Asp Ala Ala Glu 
1605 1610 1615 

Phe Asp Pro Gly Phe Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met 
40 1620 1625 1630 

Asp Pro Gin Gin Arg Leu He Leu Glu Thr Ser Trp Glu Ser Phe Glu 
1635 1640 1645 



Arg Ala Gly He Asp Pro Val Glu Leu Arg Gly Ser Arg Thr Gly Val 
1650 1655 1660 

Phe Val Gly Thr Asn Gly Gin His Tyr Val Pro Leu Leu Gin Asp Gly 
1665 1670 1675 1680 

so Asp Glu Asn Phe Asp Gly Tyr He Ala Thr Gly Asn Ser Ala Ser Val 

1685 1690 1695 

Met Ser Gly Arg Leu Ser Tyr Val Phe Gly Leu Glu Gly Pro Ala Val 
1700 1705 1710 



Thr Val Asp Thr Ala Cys Ser Ala Ser Leu Ala Ala Leu His Leu Ala 
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1715 1720 1725 

Val Gin Ser Leu Arg Arg Gly Glu Cys Asp Tyr Ala Leu Ala Gly Gly 
1730 1735 1740 

Ala Thr Val Met Ser Thr Pro Glu Met Leu Val Glu Phe Ala Arg Gin 
1745 1750 1755 1760 

Arg Ala Val Ser Pro Asp Gly Arg Ser Lys Ala Phe Ala Glu Ala Ala 
1765 1770 1775 

Asp Gly Val Gly Leu Ala Glu Gly Ala Gly Met Leu Leu Val Glu Arg 
1780 1785 1790 

Leu Ser Glu Ala Gin Lys Lys Gly His Pro Val Leu Ala Val Val Arg 
1795 1800 1805 

Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro 
1810 1815 1820 

Ser Gly Pro Ala Gin Gin Arg Val He Arg Glu Ala Leu Ala Asp Ala 
1825 1830 1835 1840 

Gly Leu Thr Pro Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly 
1845 1850 1855 

Thr Pro Leu Gly Asp Pro He Glu Ala Gly Ala Leu Leu Ala Thr Tyr 

I860 1865 1870 

Gly Arg Asp Arg Arg Asp Gly Pro Leu Trp Leu Gly Ser Leu Lys Ser 
1875 1880 1885 

Asn He Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val He Lys 
1890 1895 1900 

Met Val Leu Ala Leu Arg His Gly Glu Leu Pro Arg Thr Leu His Ala 
1905 1910 1915 1920 

Ser Thr Ala Ser Ser Arg He Asp Trp Asp Ala Gly Ala Val Glu Leu 
1925 1930 1935 

Leu Asp Glu Ala Arg Pro Trp Leu Gin Arg Ala Glu Gly Pro Arg Arg 
1940 1945 1950 

Ala Gly He Ser Ser Phe Gly He Ser Gly Thr Asn Ala His Leu Val 
1955 i960 1965 

He Glu Glu Pro Pro Glu Pro Thr Ala Pro Glu Leu Leu Ala Pro Glu 
1970 1975 1980 

Pro Ala Ala Asp Gly Asp Val Trp Ser Glu Glu Trp Trp His Glu Val 
1985 1990 1995 2000 

Thr Val Pro Leu Met Met Ser Ala His Asn Glu Ala Ala Leu Arg Asp 
2005 2010 2015 

Gin Ala Arg Arg Leu Arg Ala Asp Leu Leu Ala His Pro Glu Leu His 
2020 2025 2030 

Pro Ala Asp Val Gly Tyr Thr Leu He Thr Thr Arg Thr Arg Phe Glu 
2035 2040 2045 
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Gin Arg Ala Ala Val Val Gly Glu Asn Phe Thr Glu Leu He Ala Ala 
2050 2055 2060 

Leu Asp Asp Leu Val Glu Gly Arg Pro His Pro Leu Val Leu Arg Gly 
2065 2070 2075 2080 

Thr Ala Gly Thr Ser Asp Gin Val Val Phe Val Phe Pro Gly Gin Gly 
2085 2090 2095 

Ser Gin Trp Pro Glu Met Ala Asp Gly Leu Leu Ala Arg Ser Ser Gly 
2100 2105 2110 

Ser Gly Ser Phe Leu Glu Thr Ala Arg Ala Cys Asp Leu Ala Leu Arg 
2115 2120 2125 

Pro His Leu Gly Trp Ser Val Leu Asp Val Leu Arg Arg Glu Pro Gly 
2130 2135 2140 

Ala Pro Ser Leu Asp Arg Val Asp Val Val Gin Pro Val Leu Phe Thr 
2145 2150 2155 2160 

Met Met Val Ser Leu Ala Glu Thr Trp Arg Ser Leu Gly Val Glu Pro 
2165 2170 2175 

Ala Ala Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala Tyr Val 
2180 2185 2190 

Ala Gly Ala Leu Thr Leu Asp Asp Ala Ala Arg He Val Ala Leu Arg 
2195 2200 2205 

Ser Gin Ala Trp Leu Arg Leu Ala Gly Lys Gly Gly Met Val Ala Val 

2210 2215 2220 

Thr Leu Ser Glu Arg Asp Leu Arg Pro Arg Leu Glu Pro Trp Ser Asp 
2225 2230 2235 2240 

Arg Leu Ala Val Ala Ala Val Asn Gly Pro Glu Thr Cys Ala Val Ser 
2245 2250 2255 

Gly Asp Pro Asp Ala Leu Ala Glu Leu Val Ala Glu Leu Gly Ala Glu 
2260 2265 2270 

Gly Val His Ala Arg Pro He Pro Gly Val Asp Thr Ala Gly His Ser 
2275 2280 2285 

Pro Gin Val Asp Thr Leu Glu Ala His Leu Arg Lys Val Leu Ala Pro 
2290 2295 2300 

Val Ala Pro Arg Thr Ser Asp He Pro Phe Tyr Ser Thr Val Thr Gly 
2305 2310 2315 2320 

Gly Leu He Asp Thr Ala Glu Leu Asp Ala Asp Tyr Trp Tyr Arg Asn 
2325 2330 2335 

Met Arg Glu Pro Val Glu Phe Glu Gin Ala Thr Arg Ala Leu He Ala 
2340 2345 2350 

Asp Gly His Asp Val Phe Leu Glu Ser Ser Pro His Pro Met Leu Ala 
2355 2360 2365 
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15 



20 



Val Ser Leu Gin Glu Thr He Ser Asp Ala Gly Ser Pro Ala Ala Val 
2370 2375 2380 

Leu Gly Thr Leu Arg Arg Gly Gin Gly Gly Pro Arg Trp Leu Gly Val 
2385 2390 2395 2400 

Ala Leu Cys Arg Ala Tyr Thr His Gly Leu Glu He Asp Ala Glu Ala 
2405 2410 2415 

lie Phe Gly Pro Asp Ser Arg Gin Val Glu Leu Pro Thr Tyr Pro Phe 
2420 2425 2430 

Gin Arg Glu Arg Tyr Trp Tyr Ser Pro Gly His Arg Gly Asp Asp Pro 
2435 2440 2445 

Ala Ser Leu Gly Leu Asp Ala Val Asp His Pro Leu Leu Gly Ser Gly 
2450 2455 2460 

Val Glu Leu Pro Glu Ser Gly Asp Arg Met Tyr Thr Ala Arg Leu Gly 
2465 2470 2475 2480 

Ala Asp Thr Thr Pro Trp Leu Ala Asp His Ala Leu Leu Gly Ser Pro 
2485 2490 2495 

Leu Leu Pro Gly Ala Ala Phe Ala Asp Leu Ala Leu Trp Ala Gly Arg 
2500 2505 2510 

Gin Ala Gly Thr Gly Arg Val Glu Glu Leu Thr Leu Ala Ala Pro Leu 
2515 2520 2525 

Val Leu Pro Gly Ser Gly Gly Val Arg Leu Arg Leu Asn Val Gly Ala 
2530 2535 2540 

Pro Gly Thr Asp Asp Ala Arg Arg Phe Ala Val His Ala Arg Ala Glu 
2545 2550 2555 2560 

Gly Ala Thr Asp Trp Thr Leu His Ala Glu Gly Leu Leu Thr Ala Gin 

2565 2570 2575 

Asp Thr Ala Asp Ala Pro Asp Ala Ser Ala Ala Thr Pro Pro Pro Gly 
2580 2585 2590 

Ala Glu Gin Leu Asp He Gly Asp Phe Tyr Gin Arg Phe Ser Glu Leu 
40 2 5 9 5 2 6 0 0 2 6 0 5 

Gly Tyr Gly Tyr Gly Pro Phe Phe Arg Gly Leu Val Ser Ala His Arg 
2610 2615 2620 



25 



30 



35 



45 



Cys Gly Pro Asp He His Ala Glu Val Ala Leu Pro Val Gin Ala Gin 
26 25 2630 2635 2640 

Gly Asp Ala Ala Arg Phe Gly He His Pro Ala Leu Leu Asp Ala Ala 
2645 2650 2655 

so Leu Gin Thr Met Ser Leu Gly Gly Phe Phe Pro Glu Asp Gly Arg Val 

2660 2665 2670 

Arg Met Pro Phe Ala Leu Arg Gly Val Arg Leu Tyr Arg Ala Gly Ala 
2675 2680 2685 



55 



Asp Arg Leu His Val Arg Val Ser Pro Val Ser Glu Asp Ala Val Arg 
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2690 2695 2700 

lie Arg Cys Ala Asp Gly Glu Gly Arg Pro Val Ala Glu He Glu Ser 
2705 2710 2715 2720 

Phe He Met Arg Pro Val Asp Pro Gly Gin Leu Leu Gly Gly Arg Pro 
2725 2730 2735 

Val Gly Ala Asp Ala Leu Phe Arg He Ala Trp Arg Glu Leu Ala Ala 
2740 2745 2750 

Gly Pro Gly Thr Arg Thr Gly Asp Gly Thr Pro Pro Pro Val Arg Trp 
2755 2760 2765 

Val Leu Ala Gly Pro Asp Ala Leu Gly Leu Ala Glu Ala Ala Asp Ala 
2770 ~ 2775 2780 

His Leu Pro Ala Val Pro Gly Pro Asp Gly Ala Leu Pro Ser Pro Thr 
2785 2790 2795 2800 

Gly Arg Pro Ala Pro Asp Ala Val Val Phe Ala Val Arg Ala Gly Thr 
2805 2810 2815 

Gly Asp Val Ala Ala Asp Ala His Thr Val Ala Cys Arg Val Leu Asp 
2820 2825 2830 

Leu Val Gin Arg Arg Leu Ala Ala Pro Glu Gly Pro Asp Gly Ala Arg 
2835 2840 2845 

Leu Val Val Ala Thr Arg Gly Ala Val Ala Val Arg Asp Asp Ala Glu 
2850 2855 2860 

Val Asp Asp Pro Ala Ala Ala Ala Ala Trp Gly Leu Leu Arg Ser Ala 
2865 2870 2875 2880 

Gin Ala Glu Glu Pro Gly Arg Phe Leu Leu Val Asp Leu Asp Asp Asp 
2885 2890 2895 

Pro Ala Ser Ala Arg Ala Leu Thr Asp Ala Leu Ala Ser Gly Glu Pro 
2900 2905 2910 

Gin Thr Ala Val Arg Ala Gly Thr Val Tyr Val Pro Arg Leu Glu Arg 

2915 2920 2925 

Ala Ala Asp Arg Thr Asp Gly Pro Leu Thr Pro Pro Asp Asp Gly Ala 
2930 2935 2940 

Trp Arg Leu Gly Arg Gly Thr Asp Leu Thr Leu Asp Gly Leu Ala Leu 
2945 2950 2955 2960 

Val Pro Ala Pro Asp Ala Glu Ala Pro Leu Glu Pro Gly Gin Val Arg 
2965 2970 2975 

Val Ala Val Arg Ala Ala Gly Val Asn Phe Arg Asp Ala Leu He Ala 
2980 2985 2990 

Leu Gly Met Tyr Pro Gly Glu Ala Glu Met Gly Thr Glu Gly Ala Gly 
2995 3000 3005 

Thr Val Val Glu Val Gly Pro Gly Val Thr Gly Val Ala Val Gly Asp 
3010 3015 3020 
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Arg Val Leu Gly Leu Trp Asp Gly Gly Leu Gly Pro Leu Cys Val Ala 
3025 3030 3035 ~ 3040 

Asp His Arg Leu Leu Ala Pro Val Pro Asp Gly Trp Ser Tyr Ala Gin 
3045 3050 " 3055 

Ala Ala Ser Val Pro Ala Val Phe Leu Ser Ala Tyr Tyr Gly Leu Val 
3060 3065 ^ 3070 

Thr Leu Ala Gly Leu Arg Pro Gly Glu Arg Val Leu Val His Ala Ala 
3075 3080 3085 

Ala Gly Gly Val Gly Met Ala Ala Val Gin He Ala Arg His Leu Gly 
3090 3095 3100 

Ala Glu Val Leu Ala Thr Ala Ser Pro Gly Lys Trp Asp Ala Leu Ara 
31 °5 3110 3115 3120 

Ala Met Gly He Thr Asp Asp His Leu Ala Ser Ser Arg Thr Leu Asp 
3125 3130 3135 

Phe Ala Thr Ala Phe Thr Gly Ala Asp Gly Thr Ser Arg Ala Asp Val 
3140 3145 3150 

Val Leu Asn Ser Leu Thr Lys Glu Phe Val Asp Ala Ser Leu Gly Leu 
3155 3160 3165 

Leu Arg Pro Gly Gly Arg Phe Leu Glu Leu Gly Lys Thr Asp Val Arg 
3170 3175 3180 

Asp Pro Glu Arg He Ala Ala Glu His Pro Gly Val Arg Tyr Arg Ala 
3185 3190 3195 3200 

Phe Asp Leu Asn Glu Ala Gly Pro Asp Ala Leu Gly Arg Leu Leu Arg 
3205 3210 ~ 3215 

Glu Leu Met Asp Leu Phe Ala Ala Gly Val Leu His Pro Leu Pro Val 
3220 3225 3230 

Val Thr His Asp Val Arg Arg Ala Ala Asp Ala Leu Arg Thr He Ser 
3235 3240 3245 

Gin Ala Arg His Thr Gly Lys Leu Val Leu Thr Met Pro Pro Ala Trp 
3250 3255 3260 

His Pro Tyr Gly Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly 
3265 3270 3275 " 3280 

Ser Arg He Ala Arg His Leu Ala Ser Arg His Gly Val Arg Arg Leu 
3285 3290 3295 

Leu He Ala Ala Arg Arg Gly Pro Asp Gly Glu Gly Ala Ala Glu Leu 
3300 3305 3310 

Val Ala Asp Leu Ala Ala Leu Gly Ala Ser Ala Thr Val Val Ala Cys 
3315 3320 3325 

Asp Val Ser Asp Ala Asp Ala Val Arg Gly Leu Leu Ala Gly He Pro 
3330 3335 3340 
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Ala Asp His Pro Leu Thr Ala Val Val His Ser Thr Gly Val Leu Asp 
3345 3350 3355 3360 

Asp Gly Val Leu Pro Gly Leu Thr Pro Glu Arg Met Arg Arg Val Leu 
3365 3370 3375 

Arg Pro Lys Val Glu Ala Ala Val His Leu Asp Glu Leu Thr Arg Asp 
3380 3385 3390 

Leu Asp Leu Ser Ala Phe Val Leu Phe Ser Ser Ser Ala Gly Leu Leu 
3395 3400 3405 

Gly Ser Pro Ala Gin Gly Asn Tyr Ala Ala Ala Asn Ala Thr Leu Asp 
3410 3415 3420 

'5 Ala Leu Ala Ala Arg Arg Arg Ser Leu Gly Leu Pro Ser Val Ser Leu 

3425 3430 3435 3440 

Ala Trp Gly Leu Trp Ser Asp Thr Ser Arg Met Ala His Ala Leu Asp 
3445 3450 3455 

20 

Gin Glu Ser Leu Gin Arg Arg Phe Ala Arg Ser Gly Phe Pro Pro Leu 
3460 3465 3470 

Ser Ala Thr Leu Gly Ala Ala Leu Phe Asp Ala Ala Leu Arg Val Asp 
3475 3480 3485 

Glu Ala Val Gin Val Pro Met Arg Phe Asp Pro Ala Ala Leu Arg Ala 
3490 3495 3500 

Thr Gly Ser Val Pro Ala Leu Leu Ser Asp Leu Val Gly Ser Ala Pro 
3505 3510 3515 3520 

Ala Thr Gly Ser Ala Ala Pro Ala Ser Gly Pro Leu Pro Ala Pro Asp 
3525 3530 3535 

Ala Gly Thr Val Gly Glu Pro Leu Ala Glu Arg Leu Ala Gly Leu Ser 
35 " 3540 3545 3550 

Ala Glu Glu Arg His Asp Arg Leu Leu Gly Leu Val Gly Glu His Val 
3555 ~ 3560 3565 

40 Ala Ala Val Leu Gly His Gly Ser Ala Ala Glu Val Arg Pro Asp Arg 

3570 3575 3580 

Pro Phe Arg Glu Val Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg 
3585 3590 3595 3600 



25 



30 



45 



Asn Arg Met Ala Ala Val Thr Gly Val Arg Leu Pro Ala Thr Leu Val 
3605 3610 3615 



Phe Asp His Pro Thr Pro Ala Ala Leu Ser Ser His Leu Asp Gly Leu 
3620 3625 3630 

50 Leu Ala Pro Ala Gin Pro Val Thr Thr Thr Pro Leu Leu Ser Glu Leu 

3635 3640 3645 

Asp Arg He Glu Glu Ala Leu Ala Ala Leu Thr Pro Glu His Leu Ala 
3650 3655 3660 



55 



Glu Leu Ala Pro Ala Pro Asp Asp Arg Ala Glu Val Ala Leu Arg Leu 
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3665 3670 3675 3680 

Asp Ala Leu Ala Asp Arg Trp Arg Ala Leu His Asp Gly Ala Pro Gly 
3685 3690 3695 

Ala Asp Asp Asp lie Thr Asp Val Leu Ser Ser Ala Asp Asp Asp Glu 
3700 3705 3710 

He Phe Ala Phe He Asp Glu Arg Tyr Gly Thr Ser 
3715 3720 



(2) INFORMATION FOR SEQ ID NO ill: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1580 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Ala Asn Glu Glu Lys Leu Arg Ala Tyr Leu Lys Arg Val Thr Gly 
15 10 15 

Glu Leu His Arg Ala Thr Glu Gin Leu Arg Ala Leu Asp Arg Arg Ala 
20 25 30 

His Glu Pro He Ala He Val Gly Ala Ala Cys Arg Leu Pro Gly Gly 
35 40 45 

Val Glu Ser Pro Asp Asp Leu Trp Glu Leu Leu His Ala Gly Ala Asp 
50 55 60 

Ala Val Gly Pro Ala Pro Ala Asp Arg Gly Trp Asp Val Glu Gly Arg 
65 70 75 80 

Tyr Ser Pro Asp Pro Asp Thr Pro Gly Thr Ser Tyr Cys Arg Glu Gly 
85 90 95 

Gly Phe Val Gin Gly Ala Asp Arg Phe Asp Pro Ala Leu Phe Gly He 
100 105 HO 

Ser Pro Asn Glu Ala Leu Thr Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 

Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly Leu Asp Pro Gin Ser 
130 135 140 

Leu Ala Gly Ser Arg Thr Gly Val Phe Ala Gly Ala Trp Glu Ser Gly 
145 150 155 160 

Tyr Gin Lys Gly Val Glu Gly Leu Glu Ala Asp Leu Glu Ala Gin Leu 

165 170 175 

Leu Ala Gly He Val Ser Phe Thr Ala Gly Arg Val Ala Tyr Ala Leu 
180 185 190 

Gly Leu Glu Gly Pro Ala Leu Thr He Asp Thr Ala Cys Ser Ser Ser 
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195 



200 



205 



Leu Val Ala Leu His Leu Ala Val Gin Ser Leu Arg Arg Gly Glu Cys 
210 215 220 

Asp Leu Ala Leu Ala Gly Gly Ala Thr Val He Ala Asp Phe Ala Leu 
225 230 235 240 

Phe Thr Gin Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys 
245 250 255 

Lys Ala Phe Gly Glu Thr Ala Asp Gly Phe Gly Pro Ala Glu Gly Ala 
260 265 270 

Gly Met Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His 
275 280 285 

Pro Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala 
290 295 300 

Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin Arg Val He 
305 310 315 320 

Arg Glu Ala Leu Ala Asp Ala Gly Leu Thr Pro Ala Asp Val Asp Ala 
325 330 335 

Val Glu Ala His Gly Thr Gly Thr Pro Leu Gly Asp Pro He Glu Ala 
340 345 350 

Gly Ala Leu Met Ala Thr Tyr Gly His Glu Arg Thr Gly Asp Pro Leu 
355 360 365 

Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Thr Gin Ala Ala Ala 
370 375 380 

Gly Val Ala Gly Val He Lys Met Val Leu Ala Leu Arg His Gly Glu 
385 390 395 400 

Leu Pro Arg Thr Leu His Ala Ser Thr Ala Ser Ser Arg He Glu Trp 
405 410 415 

Asp Ala Gly Ala Val Glu Leu Leu Asp Glu Ala Arg Pro Trp Pro Arg 
420 425 430 

Arg Ala Glu Gly Pro Arg Arg Ala Gly He Ser Ser Phe Gly He Ser 
435 440 445 

Gly Thr Asn Ala His Leu Val He Glu Glu Glu Pro Pro Ala Arg Pro 
450 455 460 

Glu Pro Glu Glu Ala Ala Gin Pro Pro Ala Pro Ala Thr Thr Val Leu 
465 470 475 480 

Pro Leu Ser Ala Ala Gly Ala Arg Ser Leu Arg Glu Gin Ala Arg Arg 
485 490 495 

Leu Ala Ala His Leu Ala Gly His Glu Glu He Thr Ala Ala Asp Ala 
500 505 510 



Ala Arg Ser Ala Ala Thr Thr Arg Ala Ala Leu Ser His Arg Ala Ser 
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515 520 525 

Val Leu Ala Asp Asp Arg Arg Ala Leu lie Asp Arg Leu Thr Ala Leu 
530 535 540 

Ala Glu Asp Arg Lys Asp Pro Gly Val Thr Val Gly Glu Ala Gly Ser 
545 550 555 560 

Gly Arg Pro Pro Val Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Thr 
565 570 575 

Gly Met Gly Ala Glu Leu Leu Asp Arg Ala Pro Val Phe Arg Ala Lys 
580 585 590 

Ala Glu Glu Cys Ala Arg Ala Leu Ala Ala His Leu Asp Trp Ser Val 
595 600 605 

Leu Asp Val Leu Arg Asp Ala Pro Gly Ala Pro Pro lie Asp Arg Ala 
610 615 620 

Asp Val Val Gin Pro Thr Leu Phe Thr Met Met Val Ser Leu Ala Ala 
625 630 635 640 

Leu Trp Glu Ser His Gly Val Arg Pro Ala Ala Val Val Gly His Ser 
645 650 655 

Gin Gly Glu lie Ala Ala Ala His Ala Ala Gly Ala Leu Ser Leu Asp 
660 665 670 

Asp Ala Ala Arg Val lie Ala Glu Arg Ser Arg Leu Trp Lys Arg Leu 
675 680 685 

Ala Gly Asn Gly Gly Met Leu Ser Val Met Ala Pro Ala Asp Arg Val 
690 695 700 

Arg Glu Leu Met Glu Pro Trp Ala Glu Arg Met Ser Val Ala Ala Val 

705 710 715 720 

Asn Gly Pro Ala Ser Val Thr Val Ala Gly Asp Ala Arg Ala Leu Glu 
725 730 735 

Glu Phe Gly Gly Arg Leu Ser Ala Ala Gly Val Leu Arg Trp Pro Leu 
740 745 750 

Ala Gly Val Asp Phe Ala Gly His Ser Pro Gin Val Glu Gin Phe Arg 
755 760 765 

Ala Glu Leu Leu Asp Thr Leu Gly Thr Val Arg Pro Thr Ala Ala Arg 
770 775 780 

Leu Pro Phe Phe Ser Thr Val Thr Ala Ala Ala His Glu Pro Glu Gly 
785 790 795 800 

Leu Asp Ala Ala Tyr Trp Tyr Arg Asn Met Arg Glu Pro Val Glu Phe 
805 810 815 

Ala Ser Thr Leu Arg Thr Leu Leu Arg Glu Gly His Arg Thr Phe Val 
820 825 830 



Glu Met Gly Pro His Pro Leu Leu Gly Ala Ala lie Asp Glu Val Ala 
835 840 845 
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Glu 


Ala 
850 


Glu Gly 


Val 


His 


Ala 
855 


Thr 


Ala 


Leu Ala 


Thr 
860 


Leu 


His 


Arg 


Gly 


5 


Ser 
865 
His 


Gly 
Gly 


Gly Leu 
Val Arg 


Asp 

Val 
885 


Arg 
870 
Asp 


Phe 
Trp 


Arg 
Asp 


Ser 
Ala 


Ser Val 
875 
Leu Phe 
890 


Gly 
Glu 


Ala 
Gly 


Ala Phe 

Ser Gly 
895 


Ala 
880 
Ala 


1U 


Arg 


Arg 


Val Pro 
900 


Leu 


Pro 


Thr 


Tyr 


Ala 
905 


Phe Ser 


Arg 


Asp 


Arg Tyr 
910 


Trp 




Leu 


Pro 


Thr Ala 
915 


He 


Gly 


Arg 


Arg 
920 


Ala 


Val Glu 


Ala 


Ala 

925 


Pro 


Val 


Asp 


15 


Ala 


Ser 
930 


Ala Pro 


Gly 


Arg 


Tyr 
935 


Arg 


Val 


Thr Trp 


Thr 
940 


Pro 


Val 


Ala 


Ser 




Asp 
945 


Asp 


Ser Gly 


Arg 


Pro 
950 


Ser 


Gly 


Arg 


Trp Leu 
955 


Leu 


Val 


Gin 


Thr 


Pro 

OCA 

960 


20 


Gly 


Thr 


Ala Pro 


Asp 
965 


Glu 


Ala 


Asp 


Thr 


Ala Ala 
97 0 


Ser 


Ala 


Leu Gly 
975 


Ala 


25 


Ala 


Gly 


Val Val 
980 


Val 


Glu 


Arg 


Cys 


Leu 
985 


Leu Asp 


Pro 


Thr 


Glu 
990 


Ala 


Ala 




Arg 


Val 


Thr Leu 
995 


Thr 


Glu 


Arg 


Leu Ala 
1000 


Glu Leu 


Asp 


Ala Gin 
1005 


Pro 


Glu 


30 


Gly 


Leu Ala Gly 
1010 


Val 


Leu 


Val Leu 
1015 


Pro 


Gly Arg 


Pro Gin 
1020 


Ser 


Thr 


Ala 




Pro Ala 
1025 


Asp Ala 


Ser 


Pro Leu 
1030 


Asp 


Pro 


Gly Thr Ala 
1035 


Ala 


Val 


Leu 


Leu 
1040 


35 


Val 


Val 


Gin Ala 


Val Pro 
1045 


Asp 


Ala 


Ala 


Pro Lys 
1050 


Ala 


Arg 


He 


Trp Val 
1055 




Val 


Thr 


Arg Gly Ala 
1060 


Val 


Ala 


Val 


Gly Ser Gly 
1065 


Glu 


Val 


Pro Cys 
1070 


Ala 


40 


Val Gly 


Ala Arg 
1075 


Val 


Trp 


Gly 


Leu Gly 
1080 


Arg Val 


Ala 


Ala Leu 
1085 


Glu 


Val 




Pro Val Gin Trp 
1090 


Gly 


Gly 


Leu Val 
1095 


Asp 


Val Ala 


Val Gly 
1100 


Ala 


Gly 


Val 


45 


Arg Glu Trp Arg 
1105 


Arg 


Val Val 
1110 


Gly 


Val 


Val Ala Gly 
1115 


Gly 


Gly 


Glu 


Asp 
1120 




Gin Val 


Ala Val 


Arg 


Gly Gly Gly 


Val 


Phe Gly Arg 


Arg 


Leu 


Val 


Gly 



1125 1130 1135 

so 

Val Gly Val Arg Gly Gly Ser Gly Val Trp Arg Ala Arg Gly Cys Val 
1140 1145 1150 

Val Val Thr Gly Gly Leu Gly Gly Val Gly Gly His Val Ala Arg Trp 
5s 1155 1160 1165 
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Leu Ala Arg Ser Gly Ala Glu His Val Val Leu Ala Gly Arg Arg Gly 
1170 1175 1180 

Gly Gly Val Val Gly Ala Val Glu Leu Glu Arg Glu Leu Val Gly Leu 
1185 1190 1195 1200 

Gly Ala Lys Val Thr Phe Val Ser Cys Asp Val Gly Asp Arg Ala Ser 
1205 1210 1215 

Met Val Gly Leu Leu Gly Val Val Glu Gly Leu Gly Val Pro Leu Arg 

1220 1225 * 1230 

Gly Val Phe His Ala Ala Gly Val Ala Gin Val Ser Gly Leu Gly Glu 
1235 1240 1245 

Val Ser Leu Ala Glu Ala Gly Gly Val Leu Gly Gly Lys Ala Val Gly 
1250 1255 1260 

Ala Glu Leu Leu Asp Glu Leu Thr Ala Gly Val Glu Leu Asp Ala Phe 
1265 1270 1275 1280 

Val Leu Phe Ser Ser Gly Ala Gly Val Trp Gly Ser Gly Gly Gin Ser 
1285 1290 1295 

Val Tyr Ala Ala Ala Asn Ala His Leu Asp Ala Leu Ala Glu Arg Arg 
1300 1305 1310 

Arg Ala Gin Gly Arg Pro Ala Thr Ser Val Ala Trp Gly Leu Trp Gly 
1315 1320 1325 

Gly Glu Gly Met Gly Ala Asp Glu Gly Val Thr Glu Phe Tyr Ala Glu 
1330 1335 1340 

Arg Gly Leu Ala Pro Met Arg Pro Glu Ser Gly He Glu Ala Leu His 
1345 1350 1355 1360 

Thr Ala Leu Asn Glu Gly Asp Thr Cys Val Thr Val Ala Asp He Asp 
1365 1370 1375 

Trp Glu His Phe Val Thr Gly Phe Thr Ala Tyr Arg Pro Ser Pro Leu 
1380 1385 1390 

He Ser Asp He Pro Gin Val Arg Ala Leu Arg Thr Pro Glu Pro Thr 
1395 1400 1405 

Val Asp Ala Ser Asp Gly Leu Arg Arg Arg Val Asp Ala Ala Leu Thr 
1410 1415 1420 

Pro Arg Glu Arg Thr Lys Val Leu Val Asp Leu Val Arg Thr Val Ala 
1425 1430 1435 ~ 1440 

Ala Glu Val Leu Gly His Asp Gly He Gly Gly He Gly His Asp Val 
1445 1450 1455 

Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Ala Ala Val Arg Met Arg 
1460 1465 1470 

Gly Arg Leu Ala Glu Ala Thr Gly Leu Val Leu Pro Ala Thr Val He 
1475 1480 1485 

Phe Asp His Pro Thr Val Asp Arg Leu Gly Gly Ala Leu Leu Glu Arg 
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1490 1495 1500 

Leu Ser Ala Asp Glu Pro Ala Pro Gly Gly Ala Pro Glu Pro Ala Gly 
1505 1510 1515 1520 

Gly Arg Pro Ala Thr Pro Pro Pro Ala Pro Glu Pro Ala Val His Asp 
1525 1530 1535 

Ala Asp lie Asp Glu Leu Asp Ala Asp Ala Leu He Arg Leu Ala Thr 
1540 1545 1550 

Gly Thr Ala Gly Pro Ala Asp Gly Thr Pro Ala Asp Gly Gly Pro Asp 
1555 1560 1565 

Ala Ala Ala Thr Ala Pro Asp Gly Ala Pro Glu Gin 
1570 1575 1580 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1891 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ser Pro Ser Met Asp Glu Val Leu Gly Ala Leu Arg Thr Ser Val 
15 10 15 

Lys Glu Thr Glu Arg Leu Arg Arg His Asn Arg Glu Leu Leu Ala Gly 
20 25 30 

Ala His Glu Pro Val Ala He Val Gly Met Ala Cys Arg Tyr Pro Gly 
35 40 45 

Gly Val Ser Thr Pro Asp Asp Leu Trp Glu Leu Ala Ala Asp Gly Val 
50 55 60 

Asp Ala He Thr Pro Phe Pro Ala Asp Arg Gly Trp Asp Glu Asp Ala 
65 70 75 80 

Val Tyr Ser Pro Asp Pro Asp Thr Pro Gly Thr Thr Tyr Cys Arg Glu 
85 90 95 

Gly Gly Phe Leu Thr Gly Ala Gly Asp Phe Asp Ala Ala Phe Phe Gly 
100 105 110 

He Ser Pro Asn Glu Ala Leu Val Met Asp Pro Gin Gin Arg Leu Leu 
115 120 125 

Leu Glu Thr Ser Trp Glu Thr Leu Glu Arg Ala Gly He Val Pro Ala 
130 135 140 

Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Val Gly Ala Ala His Thr 
145 " 150 155 160 
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Gly Tyr Val Thr Asp Thr Ala Arg Ala Pro Glu Gly Thr Glu Gly Tyr 
165 170 175 

Leu Leu Thr Gly Asn Ala Asp Ala Val Met Ser Gly Arg lie Ala Tyr 
180 185 190 

Ser Leu Gly Leu Glu Gly Pro Ala Leu Thr lie Gly Thr Ala Cys Ser 
195 200 205 

Ser Ser Leu Val Ala Leu His Leu Ala Val Gin Ser Leu Arg Arg Gly 
210 215 220 

Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Ala Val Met Pro Asp Pro 
225 230 235 240 

Thr Val Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Val Asp Gly 
245 250 255 

Arg Cys Lys Ala Phe Ala Glu Gly Ala Asp Gly Thr Ala Trp Ala Glu 

260 265 270 

Gly Val Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Asn 
275 280 285 

Gly His Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp 
290 295 300 

Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin Arg 
305 310 315 320 

Val lie Arg Glu Ala Leu Ala Asp Ala Gly Leu Thr Pro Ala Asp Val 
325 330 335 

Asp Val Val Glu Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro He 
340 345 350 

Glu Ala Gly Ala Leu Leu Ala Thr Tyr Gly Arg Glu Arg Val Gly Asp 
355 360 365 

Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala 
370 375 380 

Ala Ala Gly Val Gly Gly Val He Lys Val Val Gin Ala Met Arg His 
385 390 395 400 

Gly Ser Leu Pro Arg Thr Leu His Val Asp Ala Pro Ser Ser Lys Val 
405 410 415 

Glu Trp Ala Ser Gly Ala Val Glu Leu Leu Thr Glu Gly Arg Ser Trp 
420 425 430 

Pro Arg Arg Val Glu Arg Val Arg Arg Ala Ala Val Ser Ala Phe Gly 
435 440 445 

Val Ser Gly Thr Asn Ala His Val Val Leu Glu Glu Ala Pro Val Glu 
450 455 460 



Ala Gly Ser Glu His Gly Asp Gly Pro Gly Pro Asp Arg Pro Asp Ala 
465 470 475 " 480 

Val Thr Gly Pro Leu Pro Trp Val Leu Ser Ala Arg Ser Arg Glu Ala 
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485 



490 



495 



Leu Arg Gly Gin Ala Gly Arg Leu Ala Ala Leu Ala Arg Gin Gly Arg 
500 505 510 

Thr Glu Gly Thr Gly Gly Gly Ser Gly Leu Val Val Pro Ala Ala Asp 
515 520 525 

lie Gly Tyr Ser Leu Ala Thr Thr Arg Glu Thr Leu Glu His Arg Ala 
530 ~ 535 540 

Val Ala Leu Val Gin Glu Asn Arg Thr Ala Gly Glu Asp Leu Ala Ala 
545 550 555 560 

Leu Ala Ala Gly Arg Thr Pro Glu Ser Val Val Thr Gly Val Ala Arg 
565 570 575 

Arg Gly Arg Gly lie Ala Phe Leu Cys Ser Gly Gin Gly Ala Gin Arg 
580 585 590 

Leu Gly Ala Gly Arg Glu Leu Arg Gly Arg Phe Pro Val Phe Ala Asp 
595 600 605 

Ala Leu Asp Glu lie Ala Ala Glu Phe Asp Ala His Leu Glu Arg Pro 

610 " 615 620 

Leu Leu Ser Val Met Phe Ala Glu Pro Ala Thr Pro Asp Ala Ala Leu 

625 630 635 640 

Leu Asp Arg Thr Asp Tyr Thr Gin Pro Ala Leu Phe Ala Val Glu Thr 
645 650 655 

Ala Leu Phe Arg Leu Leu Glu Ser Trp Gly Leu Val Pro Asp Val Leu 
660 665 670 

Val Gly His Ser He Gly Gly Leu Val Ala Ala His Val Ala Gly Val 
675 680 685 

Phe Ser Ala Ala Asp Ala Ala Arg Leu Val Ser Ala Arg Gly Arg Leu 
690 695 700 

Met Arg Ala Leu Pro Glu Gly Gly Ala Met Ala Ala Val Gin Ala Thr 
705 710 715 720 

Glu Arg Glu Ala Ala Ala Leu Glu Pro Val Ala Ala Gly Gly Ala Val 
725 730 735 

Val Ala Ala Val Asn Gly Pro Gin Ala Leu Val Leu Ser Gly Asp Glu 
740 745 750 

Ala Ala Val Leu Ala Ala Ala Gly Glu Leu Ala Ala Arg Gly Arg Arg 
755 760 765 

Thr Lys Arg Leu Arg Val Ser His Ala Phe His Ser Pro Arg Met Asp 
770 775 780 

Ala Met Leu Ala Asp Phe Arg Ala Val Ala Asp Thr Val Asp Tyr His 
785 790 795 800 



Ala Pro Arg Leu Pro Val Val Ser Glu Val Thr Gly Asp Leu Ala Asp 
805 810 815 
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Ala Ala Gin Leu Thr Asp Pro Gly Tyr Trp Thr Arg Gin Val Arg Gin 
820 825 830 

Pro Val Arg Phe Ala Asp Ala Val Arg Thr Ala Ser Ala Arg Asp Ala 
835 " 840 845 

Ala Thr Phe lie Glu Leu Gly Pro Asp Ala Val Leu Cys Gly Met Ala 
850 855 860 

Glu Glu Ser Leu Ala Ala Glu Ala Asp Val Val Phe Ala Pro Ala Leu 
865 870 875 880 

Arg Arg Gly Arg Pro Glu Gly Asp Thr Val Leu Arg Ala Ala Ala Ser 
885 890 895 

Ala Tyr Val Arg Gly Ala Gly Leu Asp Trp Ala Ala Leu Tyr Gly Gly 
900 905 910 

Thr Gly Ala Arg Arg Thr Asp Leu Pro Thr Tyr Ala Phe Gin His Ser 
915 920 925 

Arg Tyr Trp Leu Ala Pro Ala Ser Ala Ala Val Ala Pro Ala Thr Ala 
930 935 940 

Ala Pro Ser Val Arg Ser Val Pro Glu Ala Glu Gin Asp Gly Ala Leu 
945 950 955 960 

Trp Ala Ala Val His Ala Gly Asp Val Ala Ser Ala Ala Ala Arg Leu 

965 970 975 

Gly Ala Asp Asp Ala Gly He Glu His Glu Leu Arg Ala Val Leu Pro 
980 985 990 

His Leu Ala Ala Trp His Asp Arg Asp Arg Ala Thr Ala Arg Thr Ala 
995 1000 1005 

Gly Leu His Tyr Arg Val Thr Trp Gin Ala He Glu Ala Asp Ala Val 
1010 1015 1020 

Arg Phe Ser Pro Ser Asp Arg Trp Leu Met Val Glu His Gly Gin His 
1025 1030 1035 1040 

Thr Glu Cys Ala Asp Ala Ala Glu Arg Ala Leu Arg Ala Ala Gly Ala 
1045 1050 1055 

Glu Val Thr Arg Leu Val Trp Pro Leu Glu Gin His Thr Gly Ser Pro 
1060 1065 1070 

Arg Thr Glu Thr Pro Asp Arg Gly Thr Leu Ala Ala Arg Leu Ala Glu 
1075 1080 1085 

Leu Ala Arg Ser Pro Glu Gly Leu Ala Gly Val Leu Leu Leu Pro Asp 
1090 1095 1100 

Ser Gly Gly Ala Ala Val Ala Gly His Pro Gly Leu Asp Gin Gly Thr 
1105 1110 1115 1120 

Ala Ala Val Leu Leu Thr He Gin Ala Leu Thr Asp Ala Ala Val Arg 
1125 1130 1135 
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w 



15 



20 



25 



Ala Pro Leu Trp Val Val Thr Arg Gly Ala Val Ala Val Gly Ser Gly 
1140 1145 1150 

Glu Val Pro Cys Ala Val Gly Ala Arg Val Trp Gly Leu Gly Arg Val 
1155 1160 1165 

Ala Ala Leu Glu Val Pro Val Gin Trp Gly Gly Leu Val Asp Val Ala 
1170 1175 1180 

Val Gly Ala Gly Val Arg Glu Trp Arg Arg Val Val Gly Val Val Ala 
1185 ~ 1190 1195 1200 

Gly Gly Gly Glu Asp Gin Val Ala Val Arg Gly Gly Gly Val Phe Gly 
1205 1210 1215 

Arg Arg Leu Val Gly Val Gly Val Arg Gly Gly Ser Gly Val Trp Arg 
1220 1225 1230 

Ala Arg Gly Cys Val Val Val Thr Gly Gly Leu Gly Gly Val Gly Gly 
1235 1240 1245 

His Val Ala Arg Trp Leu Ala Arg Ser Gly Ala Glu His Val Val Leu 
1250 1255 1260 

Ala Gly Arg Arg Gly Gly Gly Val Val Gly Ala Val Glu Leu Glu Arg 
1265 ~ " 1270 1275 1280 

Glu Leu Val Gly Leu Gly Ala Lys Val Thr Phe Val Ser Cys Asp Val 
1285 1290 1295 

Gly Asp Arg Ala Ser Val Val Gly Leu Leu Gly Val Val Glu Gly Leu 
1300 1305 1310 

Gly Val Pro Leu Arg Gly Val Phe His Ala Ala Gly Val Ala Gin Val 

1315 1320 1325 

Ser Gly Leu Gly Glu Val Ser Leu Ala Glu Ala Gly Gly Val Leu Gly 
1330 1335 1340 

Gly Lys Ala Val Gly Ala Glu Leu Leu Asp Glu Leu Thr Ala Gly Val 
1345 1350 1355 1360 

Glu Leu Asp Ala Phe Val Leu Phe Ser Ser Gly Ala Gly Val Trp Gly 
1365 1370 1375 

Ser Gly Gly Gin Ser Val Tyr Ala Ala Ala Asn Ala His Leu Asp Ala 
1380 1385 1390 

Leu Ala Glu Arg Arg Arg Ala Gin Gly Arg Pro Ala Thr Ser Val Ala 
1395 1400 1405 

Trp Gly Pro Trp Asp Gly Asp Gly Met Gly Glu Met Ala Pro Glu Gly 
1410 1415 1420 

50 Tyr Phe Ala Arg His Gly Val Ala Pro Leu His Pro Glu Thr Ala Leu 

1425 1430 1435 1440 

Thr Ala Leu His Gin Ala He Asp Gly Gly Glu Ala Thr Val Thr Val 
1445 1450 1455 



30 



35 



40 



45 



55 



Ala Asp He Asp Trp Glu Arg Phe Ala Pro Gly Phe Thr Ala Phe Arg 
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1460 1465 1470 

Pro Ser Pro Leu lie Ala Gly lie Pro Ala Ala Arg Thr Ala Pro Ala 
1475 1480 1485 

Ala Gly Arg Pro Ala Glu Asp Thr Pro Thr Ala Pro Gly Leu Leu Arg 
1490 1495 1500 

Ala Arg Pro Glu Asp Arg Pro Arg Leu Ala Leu Asp Leu Val Leu Arg 
1505 1510 1515 1520 

His Val Ala Ala Val Leu Gly His Ser Glu Asp Ala Arg Val Asp Ala 
1525 1530 1535 

Arg Ala Pro Phe Arg Asp Leu Gly Phe Asp Ser Leu Ala Ala Val Arg 
1540 1545 1550 

Leu Arg Arg Arg Leu Ala Glu Asp Thr Gly Leu Asp Leu Pro Gly Thr 
1555 1560 1565 

Leu Val Phe Asp His Glu Asp Pro Thr Ala Leu Ala His His Leu Ala 
1570 1575 1580 

Gly Leu Ala Asp Ala Gly Thr Pro Gly Pro Gin Glu Gly Thr Ala Arg 
1585 1590 1595 1600 

Ala Glu Ser Gly Leu Phe Ala Ser Phe Arg Ala Ala Val Glu Gin Arg 
1605 1610 1615 

Arg Ser Ser Glu Val Val Glu Leu Met Ala Asp Leu Ala Ala Phe Arg 
1620 1625 1630 

Pro Ala Tyr Ser Arg Gin His Pro Gly Ser Gly Arg Pro Ala Pro Val 
1635 1640 1645 

Pro Leu Ala Thr Gly Pro Ala Thr Arg Pro Thr Leu Tyr Cys Cys Ala 
1650 1655 1660 

Gly Thr Ala Val Gly Ser Gly Pro Ala Glu Tyr Val Pro Phe Ala Glu 
1665 1670 1675 1680 

Gly Leu Arg Gly Val Arg Glu Thr Val Ala Leu Pro Leu Ser Gly Phe 
1685 1690 1695 

Gly Asp Pro Ala Glu Pro Met Pro Ala Ser Leu Asp Ala Leu He Glu 
1700 1705 1710 

Val Gin Ala Asp Val Leu Leu Glu His Thr Ala Gly Lys Pro Phe Ala 
1715 1720 1725 

Leu Ala Gly His Ser Ala Gly Ala Asn He Ala His Ala Leu Ala Ala 
1730 1735 1740 

Arg Leu Glu Glu Arg Gly Ser Gly Pro Ala Ala Val Val Leu Met Asp 
1745 1750 1755 1760 

Val Tyr Arg Pro Glu Asp Pro Gly Ala Met Gly Glu Trp Arg Asp Asp 
1765 1770 1775 

Leu Leu Ser Trp Ala Leu Glu Arg Ser Thr Val Pro Leu Glu Asp His 
1780 1785 1790 
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20 



25 



30 



TTC TGG GAA CTG CTG CGC AGC GGA CGC GGT ATG CCC ACC CGT CAG GAC 
Phe Trp Glu Leu Leu Arg Ser Gly Arg Gly Met Pro Thr Arg Gin Asp 
35 40 45 50 

GAC GGC ACC TGG CGG GCC GCC CTG GAG GAC CAC GCC GGC TTC GAC GCC 
Asp Gly Thr Trp Arg Ala Ala Leu Glu Asp His Ala Gly Phe Asp Ala 
55 60 65 

GGG TTC TTC GGC ATG AAC GCC CGG CAG GCC GCC GCC ACC GAC CCG CAG 
Gly Phe Phe Gly Met Asn Ala Arg Gin Ala Ala Ala Thr Asp Pro Gin 
70 75 80 

CAC CGA CTG ATG CTG GAA CTC GGA TGG GAG GCA CTG GAG GAC GCG GGC 
His Arg Leu Met Leu Glu Leu Gly Trp Glu Ala Leu Glu Asp Ala Gly 
85 90 95 

ATC GTC CCC GGC GAT CTC ACC GGC ACC GAC ACC GGA GTC TTC GCC GGC 
lie Val Pro Gly Asp Leu Thr Gly Thr Asp Thr Gly Val Phe Ala Gly 
100 105 HO 



GTG GCG TCC GAC GAC TAT GCC GTT CTC ACC CGC CGT TCC GCC GTC TCC 
Val Ala Ser Asp Asp Tyr Ala Val Leu Thr Arg Arg Ser Ala Val Ser 
115 120 125 130 

GCC GGG GGA TAC ACC GCC ACG GGG CTG CAC CGC GCC CTG GCC GCC AAC 
Ala Gly Gly Tyr Thr Ala Thr Gly Leu His Arg Ala Leu Ala Ala Asn 
135 140 145 

CGC CTC TCC CAC TTC CTG GGC CTG CGC GGC CCC AGC CTG GTC GTC GAC 
Arg Leu Ser His Phe Leu Gly Leu Arg Gly Pro Ser Leu Val Val Asp 
150 155 160 

TCG GCC CAG TCC GCC TCA CTG GTG GCC GTC CAG CTC GCC TCC GAG AGT 
Ser Ala Gin Ser Ala Ser Leu Val Ala Val Gin Leu Ala Cys Glu Ser 
165 170 175 

CTG CGC CGG GGT GAG ACG TCG CTC GCC GTC GCG GGC GGT GTC AAC CTC 
Leu Arg Arg Gly Glu Thr Ser Leu Ala Val Ala Gly Gly Val Asn Leu 
180 185 190 

ATC CTC ACC GAG GAG AGC ACC ACC GTC ATG GAG CGT ATG GGA GCG CTC 
lie Leu Thr Glu Glu Ser Thr Thr Val Met Glu Arg Met Gly Ala Leu 
i95 200 205 210 

TCA CCC GAC GGC CGC TGC CAC ACC TTC GAC GCC CGC GCC AAC GGC TAC 
Ser Pro Asp Gly Arg Cys His Thr Phe Asp Ala Arg Ala Asn Gly Tyr 
215 220 225 

GTA CGC GGC GAG GGC GGC GGA GCC GTC GTG CTC AAG CCA CTG GAC GCC 
Val Arg Gly Glu Gly Gly Gly Ala Val Val Leu Lys Pro Leu Asp Ala 
230 235 240 

GCA CTC GCC GAC GGC GAC CGC GTG TAC TCC GTC ATC AAG GGA GGT GCC 
Ala Leu Ala Asp Gly Asp Arg Val Tyr Cys Val He Lys Gly Gly Ala 
245 250 255 

GTC AAC AAC GAC GGC GGC GGC GCG AGC CTC ACC ACT CCC GAC CGg'gAG 
Val Asn Asn Asp Gly Gly Gly Ala Ser Leu Thr Thr Pro Asp Arg Glu 
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GCG CAG GAA GCT GTG CTG CGC CAG GCC TAC CGG CGG GCG GGC GTC AGC 1219 
Ala Gin Glu Ala Val Leu Arg Gin Ala Tyr Arg Arg Ala Gly Val Ser 
5 275 280 285 290 

ACC GGC GCC GTC CGC TAC GTC GAG CTG CAC GGG ACC GGC ACC CGG GCC 1267 
Thr Gly Ala Val Arg Tyr Val Glu Leu His Gly Thr Gly Thr Arg Ala 
295 300 305 

10 GGC GAC CCC GTC GAG GCG GCC GCA CTG GGC GCC GTG CTC GGG GCG GGG 1315 

Gly Asp Pro Val Glu Ala Ala Ala Leu Gly Ala Val Leu Gly Ala Gly 
310 315 320 

GCG GAC AGC GGC CGC AGC ACG CCG CTC GCC GTC GGC TCG GTG AAG ACC 1363 
15 Ala Asp Ser Gly Arg Ser Thr Pro Leu Ala Val Gly Ser Val Lys Thr 

325 ' 330 335 

AAC GTC GGC CAT CTG GAG GGC GCG GCG GGC ATC GTC GGA CTG ATC AAG 1411 
Asn Val Gly His Leu Glu Gly Ala Ala Gly lie Val Gly Leu He Lys 
20 3 4 0 3 4 5 3 5 0 

GCC ACG CTG TGC GTA CGG AAG GGC GAA CTC GTC CCC AGC CTC AAC TTC 1459 
Ala Thr Leu Cys Val Arg Lys Gly Glu Leu Val Pro Ser Leu Asn Phe 
355 360 365 370 

25 AGC ACG CCG AAC CCT GAC ATC CCC CTC GAC GAC CTG CGG CTG CGC GTC 1507 

Ser Thr Pro Asn Pro Asp He Pro Leu Asp Asp Leu Arg Leu Arg Val 
375 380 385 

CAG ACC GAA CGG CAG GAG TGG AAC GAG GAG GAC GAC CGG CCG CGC GTG 1555 
Gin Thr Glu Arg Gin Glu Trp Asn Glu Glu Asp Asp Arg Pro Arg Val 
390 395 400 

GCC GGC GTC TCC TCC TTC GGT ATG GGC GGA ACC AAT GTC CAC CTC GTG 1603 

Ala Gly Val Ser Ser Phe Gly Met Gly Gly Thr Asn Val His Leu Val 
405 410 415 

35 

ATC GCG GAG GCT CCG GCC GCG GCG GGG TCC TCC GGG GCG GGG GGT TCG 1651 

He Ala Glu Ala Pro Ala Ala Ala Gly Ser Ser Gly Ala Gly Gly Ser 
420 425 430 

GGC GCT GGT TCC GGT GCC GGT ATC AGC GCT GTT TCT GGT GTG GTG CCG 1699 

41/ 

Gly Ala Gly Ser Gly Ala Gly He Ser Ala Val Ser Gly Val Val Pro 
435 440 445 450 

GTG GTG GTT TCG GGG CGT TCG CGG GTG GTG GTG CGG GAG GCT GCG GGC 1747 
Val Val Val Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly 
45 455 460 465 

CGG TTG GCG GAG GTG GTG GAG GCC GGT GGT GTG GGG CTG GCG GAT GTG 1795 
Arg Leu Ala Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val 
470 475 480 

so 

GCG GTG ACG ATG GCG GAC CGG TCG CGG TTT GGG TAT CGG GCG GTT GTG 1843 
Ala Val Thr Met Ala Asp Arg Ser Arg Phe Gly Tyr Arg Ala Val Val 
485 490 495 

CTG GCT CGG GGT GAG GCT GAG CTT GCC GGG CGT TTG CGG GCG TTG GCG 1891 
55 Leu Ala Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala 
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10 



500 505 510 

GGG GGT GAT CCG GAC GCG GGT GTG GTC ACC GGT GCG GTT CTC GAC GGT 1939 
Gly Gly Asp Pro Asp Ala Gly Val Val Thr Gly Ala Val Leu Asp Gly 
515 520 525 530 

GGT GTG GTT GTC GGT GCT GCC CCC GGC GGT GCC GGT GCT GCC GGT GGT 1987 
Gly Val Val Val Gly Ala Ala Pro Gly Gly Ala Gly Ala Ala Gly Gly 
535 540 545 

GCC GGT GCT GCC GGT GGT GCC GGT GGT GGG GGC GTG GTG TTG GTT TTC 203 5 

Ala Gly Ala Ala Gly Gly Ala Gly Gly Gly Gly Val Val Leu Val Phe 
550 555 560 

CCT GGT CAG GGG ACG CAG TGG GTG GGG ATG GGT GCG GGG CTG CTG GGG 2083 
Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly Leu Leu Gly 
565 570 575 

TCT TCG GAG GTG TTT GCG GCG TCG ATG CGG GAG TGT GCG CGG GCG CTG 2131 
Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala Arg Ala Leu 
580 585 590 

AGT GTT CAT GTG GGG TGG GAT TTG CTG GAG GTG GTG TCG GGC GGG GCC 2179 
Ser Val His Val Gly Trp Asp Leu Leu Glu Val Val Ser Gly Gly Ala 
595 600 605 610 

25 GGG TTG GAG CGG GTG GAT GTG GTG CAG CCG GTG ACG TGG GCG GTG ATG 2227 

Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr Trp Ala Val Met 
615 620 625 



15 



20 



30 



35 



40 



45 



50 
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GTG TCG CTG GCC CGG TAC TGG CAG GCG ATG GGT GTG GAC GTG GCT GCG 2275 
Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp Val Ala Ala 
630 635 640 

GTG GTG GGT CAT TCC CAG GGG GAG ATC GCC GCT GCC ACG GTG GCG GGG 2323 
Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala Thr Val Ala Gly 
645 650 655 

GCG TTG TCG CTG GAG GAT GCG GCG GCT GTG GTC GCT CTG CGG GCG GGG 2371 
Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu Arg Ala Gly 
660 665 670 

TTG ATT GGC CGG TAT CTG GCG GGT CGT GGT GCG ATG GCG GCT GTT CCG 2419 
Leu He Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala Ala Val Pro 
675 680 685 690 

CTG CCT GCC GGC GAG GTC GAG GCC GGG CTG GCG AAG TGG CCG GGT GTG 2467 
Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp Pro Gly Val 
695 700 705 

GAG GTC GCG GCG GTC AAC GGT CCG GCG TCT ACG GTG GTT TCC GGG GAT 2515 
Glu Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val Ser Gly Asp 
710 715 720 

CGG CGG GCG GTG GCC GGT TAT GTG GCC GTC TGT CAG GCG GAG GGT GTG 2563 
Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala Glu Gly Val 
725 730 735 

CAG GCC CGG TTG ATA CCG GTG GAC TAC GCC TCT CAC TCC CGC CAT GTG 2611 
Gin Ala Arg Leu He Pro Val Asp Tyr Ala Ser His Ser Arg His Val 
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740 745 750 

GAG GAC CTG AAG GGC GAG TTG GAG CGG GTG CTG TCC GGT ATC CGC CCC 2659 
Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly lie Arg Pro 
5 755 * 760 765 770 

CGC AGT CCG CGG GTG CCG GTG TGT TCC ACC GTC GCC GGA GAG CAG CCG 2707 
Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly Glu Gin Pro 
775 780 785 

10 GGC GAG CCG GTT TTC GAT GCG GGG TAT TGG TTC CGT AAT CTG CGG AAC 2755 

Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn 
790 795 800 

CGG GTT GAG TTC TCC GCG GTG GTC GGT GGT TTG TTG GAG GAG GGC CAC 2803 
is Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu Glu Gly His 

805 810 815 

CGT CGG TTC ATC GAG GTC AGT GCC CAC CCG GTA CTC GTC CAT GCG ATC 2851 
Arg Arg Phe lie Glu Val Ser Ala His Pro Val Leu Val His Ala He 
2Q 820 825 830 

GAG CAG ACG GCC GAG GCC GCG GAC CGG AGT GTC CAT GCC ACC GGG ACC 2899 
Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala Thr Gly Thr 
835 840 845 850 

25 CTG CGC CGC CAG GAC GAC AGC CCG CAC CGC CTG CTG ACC TCC ACC GCC 2947 

Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr Ser Thr Ala 
855 860 865 

GAG GCC TGG GCC CAC GGC GCC ACC CTC ACC TGG GAC CCC GCC CTG CCC 2995 
Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro 
870 875 880 

CCA GGC CAC CTC ACC ACC CTC CCC ACC TAC CCC TTC AAC CAC CAC CAC 3043 

Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn His His His 
885 890 895 

35 

TAC TGG CTC GAC ACC ATT GAC GGG GGC GGA GGG GAC GAC GCG ACC CAG 3091 

Tyr Trp Leu Asp Thr He Asp Gly Gly Gly Gly Asp Asp Ala Thr Gin 
900 905 910 

40 GAG AAG GAG AGC GGC CCT CTG ACG CGG GAA CTG CGT GGG CTG CCG TCC 3139 

Glu Lys Glu Ser Gly Pro Leu Thr Arg Glu Leu Arg Gly Leu Pro Ser 
915 " 920 925 930 

TCT CAG AAG CAA CTG GGT TTC CTG CTC GAT CTG GTG TGC CGG CAC ACG 3187 
Ser Gin Lys Gin Leu Gly Phe Leu Leu Asp Leu Val Cys Arg His Thr 
45 ~ 935 940 945 

GCC GTC GTA CTC GGC CTG GAC ACG GCC GCC GAG GTG GAC CCG GAC CTG 3235 
Ala Val Val Leu Gly Leu Asp Thr Ala Ala Glu Val Asp Pro Asp Leu 
950 955 960 

50 

TCC TTC AAG AAG CAG GGC ATC CAG TCC ATG ACC GGC GTC GAG CTG CGC 3283 
Ser Phe Lys Lys Gin Gly He Gin Ser Met Thr Gly Val Glu Leu Arg 
965 970 975 

AAC AGG CTG CTG ACC GAG ACC GGC CTG GCA TTG CCC ACC ACC CTC GTC 3331 
55 Asn Arg Leu Leu Thr Glu Thr Gly Leu Ala Leu Pro Thr Thr Leu Val 
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10 



980 985 990 

TAC GAC CGG CCC ACC CCT CGC GCC CTG GCG CAG TTC CTC CAC ACC GAG 3379 
Tyr Asp Arg Pro Thr Pro Arg Ala Leu Ala Gin Phe Leu His Thr Glu 
995 1000 1005 1010 

TTG CTC GAC GGC TCC CCC TCG GGC TCC GTC CTC GCA CCG GCG CAG AAG 3427 
Leu Leu Asp Gly Ser Pro Ser Gly Ser Val Leu Ala Pro Ala Gin Lys 
1015 1020 1025 

AGC TTC GAG GCC GGC GGG CCC GGA GTG CTC TCG TCG GCC GCG GTA GGG 3475 
Ser Phe Glu Ala Gly Gly Pro Gly Val Leu Ser Ser Ala Ala Val Gly 
1030 1035 1040 

GTG TCG GAC GCC CGG GGC GGC AGC CGG GAC GAC GAC GAC CCG ATC GCC 3523 
15 Val Ser Asp Ala Arg Gly Gly Ser Arg Asp Asp Asp Asp Pro lie Ala 

1045 1050 1055 

ATC GTG GGT GTC GGC TGC CGG CTC CCC GGC GGC GTC GAC TCG CGC GCC 3571 
He Val Gly Val Gly Cys Arg Leu Pro Gly Gly Val Asp Ser Arg Ala 
20 1060 1065 1070 

GCT CTC TGG GAG CTG CTG GAG TCC GGC GCC GAC GCC ATC TCG TCC TTC 3619 
Ala Leu Trp Glu Leu Leu Glu Ser Gly Ala Asp Ala He Ser Ser Phe 
1075 1080 1085 1090 

25 CCC ACC GAC CGC GGC TGG GAC CTC GAC GGG CTG TAC GAC CCC GAG CCC 3 667 

Pro Thr Asp Arg Gly Trp Asp Leu Asp Gly Leu Tyr Asp Pro Glu Pro 
1095 1100 1105 

GGG ACG CCC GGC AAG ACC TAT GTG CGG GAG GGC GGG TTC CTG CAC TCG 3715 
30 Gly Thr Pro Gly Lys Thr Tyr Val Arg Glu Gly Gly Phe Leu His Ser 

1110 1115 1120 

GCG GCC GAG TTC GAC GCG GAG TTC TTC GGG ATA TCG CCG CGC GAG GCC 3763 

Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala 
1125 1130 1135 

35 

ACG GCC ATG GAC CCG CAG CAG CGC TTG CTG CTG GAA GCG TCG TGG GAG 3811 

Thr Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Ala Ser Trp Glu 
1140 1145 1150 

40 GCC CTC GAG GAC GCC GGA GTG CTC CCC GAG TCA CTG CGC GGC GGC GAC 3859 

Ala Leu Glu Asp Ala Gly Val Leu Pro Glu Ser Leu Arg Gly Gly Asp 
1155 1160 1165 1170 

GCC GGA GTG TTC GTC GGC GCC ACC GCA CCG GAG TAC GGG CCG AGG CTT 3907 
Ala Gly Val Phe Val Gly Ala Thr Ala Pro Glu Tyr Gly Pro Arg Leu 
45 1175 1180 1185 

CAC GAG GGA GCG GAC GGA TAC GAG GGG TAC CTG CTC ACC GGC ACC ACC 3955 
His Glu Gly Ala Asp Gly Tyr Glu Gly Tyr Leu Leu Thr Gly Thr Thr 
1190 1195 1200 

50 

GCG AGC GTG GCC TCC GGC CGG ATC GCC TAC ACC CTC GGC ACC GGC GGA 4003 
Ala Ser Val Ala Ser Gly Arg He Ala Tyr Thr Leu Gly Thr Gly Gly 
1205 1210 1215 

55 CCG GCG CTC ACC GTC GAC ACC GCG TGC TCC TCG TCC CTG GTG GCG CTG 4051 

Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu 
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1220 1225 1230 

CAC CTG GCC GTG CAG GCG CTG CGC CGG GGC GAG TGC GGG CTG GCT CTG 4099 
His Leu Ala Val Gin Ala Leu Arg Arg Gly Glu Cys Gly Leu Ala Leu 
1235 1240 1245 1250 

GCG GGC GGC GCC ACG GTG ATG TCG GGG CCC GGC ATG TTC GTG GAG TTC 4147 
Ala Gly Gly Ala Thr Val Met Ser Gly Pro Gly Met Phe Val Glu Phe 
1255 1260 1265 

TCG CGG CAG CGC GGG CTC GCC CCC GAC GGC CGC TGC ATG CCG TTC TCC 4195 
Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Met Pro Phe Ser 
1270 1275 1280 

GCC GAT GCC GAC GGT ACG GCC TGG TCC GAG GGT GTC GCC GTA CTG GCA 4243 
Ala Asp Ala Asp Gly Thr Ala Trp Ser Glu Gly Val Ala Val Leu Ala 
1285 1290 1295 

CTG GAG CGG CTC TCC GAC GCC CGG CGT GCG GGA CAC CGG GTG CTG GGC 4291 
Leu Glu Arg Leu Ser Asp Ala Arg Arg Ala Gly His Arg Val Leu Gly 
1300 1305 1310 

GTG GTG CGG GGC AGT GCG GTC AAC CAG GAC GGT GCC AGC AAC GGC CTG 4339 
Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu 
1315 1320 1325 1330 

25 ACC GCT CCC AAC CGC TCC GCG CAG GAG GGC GTC ATC CGA GCT GCC CTG 4387 

Thr Ala Pro Asn Arg Ser Ala Gin Glu Gly Val lie Arg Ala Ala Leu 
1335 1340 1345 
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GCC GAC GCC GGC CTC GCG CCG GGT GAC GTG GAC GCG GTG GAG GCG CAC 443 5 

Ala Asp Ala Gly Leu Ala Pro Gly Asp Val Asp Ala Val Glu Ala His 
1350 1355 1360 

GGT ACG GGG ACG GCG CTG GGC GAT CCG ATC GAG GCG AGC GCG CTG CTG 4483 
Gly Thr Gly Thr Ala Leu Gly Asp Pro He Glu Ala Ser Ala Leu Leu 
1365 1370 1375 

GCC ACG TAC GGG CGT GAG CGG GTG GGC GAC CCC TTG TGG CTC GGG TCG 4531 
Ala Thr Tyr Gly Arg Glu Arg Val Gly Asp Pro Leu Trp Leu Gly Ser 
1380 1385 1390 

CTG AAG TCC AAC GTC GGT CAC ACC CAG GCC GCC GCG GGG GCC GCG GGT 4579 
Leu Lys Ser Asn Val Gly His Thr Gin Ala Ala Ala Gly Ala Ala Gly 
1395 1400 1405 1410 

GTG GTC AAG ATG CTG CTT GCC CTG GAG CAC GGC ACG CTG CCG CGG ACA 4627 
Val Val Lys Met Leu Leu Ala Leu Glu His Gly Thr Leu Pro Arg Thr 
45 1415 1420 1425 

CTT CAC GCG GAC CGG CCC AGC ACG CAC GTC GAC TGG TCG TCG GGC ACC 4675 
Leu His Ala Asp Arg Pro Ser Thr His Val Asp Trp Ser Ser Gly Thr 
1430 1435 1440 
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55 



GTC GCC CTG CTG GCA GAG GCG CGC CGG TGG CCC CGG CGG TCG GAC CGC 4723 
Val Ala Leu Leu Ala Glu Ala Arg Arg Trp Pro Arg Arg Ser Asp Arg 
1445 1450 1455 

CCG CGC CGG GCG GCT GTG TCG TCG TTC GGG ATC AGT GGG ACG AAC GCG 4771 
Pro Arg Arg Ala Ala Val Ser Ser Phe Gly He Ser Gly Thr Asn Ala 
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w 



1460 1465 1470 

CAT CTG ATC ATC GAG GAG GCG CCG GAG TGG GTC GAG GAC ATC GAC GGC 4819 
His Leu He He Glu Glu Ala Pro Glu Trp Val Glu Asp He Asp Gly 
1475 1480 1485 1490 

GTC GCT GCT CCT GAC CGC GGT ACC GCG GAC GCG GCT GCT CCG TCG CCG 4867 
Val Ala Ala Pro Asp Arg Gly Thr Ala Asp Ala Ala Ala Pro Ser Pro 
1495 1500 1505 

CTG TTG TTG TCC GCG CGG TCG GAG GGG GCG TTG CGG GCG CAG GCG GTG 4915 
Leu Leu Leu Ser Ala Arg Ser Glu Gly Ala Leu Arg Ala Gin Ala Val 
1510 1515 1520 

CGG TTG GGT GAG TAC GTG GAG CGG GTG GGT GCG GAT CCG CGG GAT GTG 4963 
Arg Leu Gly Glu Tyr Val Glu Arg Val Gly Ala Asp Pro Arg Asp Val 
1525 1530 1535 

GCT TAT TCG CTG GCT TCG ACG CGG ACT CTT TTC GAG CAC CGT GCG GTG 5011 
Ala Tyr Ser Leu Ala Ser Thr Arg Thr Leu Phe Glu His Arg Ala Val 
1540 1545 1550 

GTG CCG TGT GGT GGG CGT GGG GAG CTC GTC GCT GCT CTT GGT GGG TTT 5059 
Val Pro Cys Gly Gly Arg Gly Glu Leu Val Ala Ala Leu Gly Gly Phe 
1555 1560 1565 1570 

25 GCT GCC GGG AGG GTG TCT GGG GGT GTG CGG TCC GGG CGG GCT GTG CCG 5107 

Ala Ala Gly Arg Val Ser Gly Gly Val Arg Ser Gly Arg Ala Val Pro 
1575 1580 1585 
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GGT GGG GTG GGG GTG TTG TTC ACG GGT CAG GGT GCG CAG TGG GTT GGT 5155 
Gly Gly Val Gly Val Leu Phe Thr Gly Gin Gly Ala Gin Trp Val Gly 
1590 1595 1600 

ATG GGG CGT GGG TTG TAT GCG GGG GGT GGG GTG TTT GCG GAG GTG CTG 5203 
Met Gly Arg Gly Leu Tyr Ala Gly Gly Gly Val Phe Ala Glu Val Leu 
1605 1610 1615 

GAT GAG GTG TTG TCG ATG GTG GGG GAG GTG GAT GGT CGG TCG TTG CGG 5251 
Asp Glu Val Leu Ser Met Val Gly Glu Val Asp Gly Arg Ser Leu Arg 
1620 1625 1630 

GAT GTG ATG TTC GGC GAC GTC GAC GTG GAC GCG GGT GCC GGG GCT GAT 5299 
Asp Val Met Phe Gly Asp Val Asp Val Asp Ala Gly Ala Gly Ala Asp 
1635 1640 1645 1650 

GCG GGT GCC GGT GCG GGT GCT GGG GTC GGT TCT GGT TCC GGT TCT GTG 5347 
Ala Gly Ala Gly Ala Gly Ala Gly Val Gly Ser Gly Ser Gly Ser Val 
1655 1660 1665 

GGT GGG TTG TTG GGT CGG ACG GAG TTT GCT CAG CCT GCG CTG TTT GCG 5395 
Gly Gly Leu Leu Gly Arg Thr Glu Phe Ala Gin Pro Ala Leu Phe Ala 
1670 1675 1680 

TTG GAG GTG GCG TTG TTC CGG GCG TTG GAG GCT CGG GGT GTG GAG GTG 5443 
Leu Glu Val Ala Leu Phe Arg Ala Leu Glu Ala Arg Gly Val Glu Val 
1685 1690 1695 

TCG GTG GTG TTG GGT CAT TCG GTG GGG GAG GTG GCT GCT GCG TAT GTG 5491 
Ser Val Val Leu Gly His Ser Val Gly Glu Val Ala Ala Ala Tyr Val 
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1700 1705 1710 

GCG GGG GTG TTG TCG TTG GGT GAT GCG GTG CGG TTG GTG GTG GCG CGG 5539 
Ala Gly Val Leu Ser Leu Gly Asp Ala Val Arg Leu Val Val Ala Arg 
1715 1720 1725 1730 

GGT GGG TTG ATG GGT GGG TTG CCG GTG GGT GGG GGG ATG TGG TCG GTG 5587 
Gly Gly Leu Met Gly Gly Leu Pro Val Gly Gly Gly Met Trp Ser Val 
1735 1740 1745 

GGG GCG TCG GAG TCG GTG GTG CGG GGG GTT GTT GAG GGG TTG GGG GAG 5635 
Gly Ala Ser Glu Ser Val Val Arg Gly Val Val Glu Gly Leu Gly Glu 
1750 1755 1760 

TGG GTG TCG GTT GCG GCG GTG AAT GGG CCG CGG TCG GTG GTG TTG TCG 5683 
'5 Trp Val Ser Val Ala Ala Val Asn Gly Pro Arg Ser Val Val Leu Ser 

1765 1770 1775 

GGT GAT GTG GGT GTG CTG GAG TCG GTG GTT GCC TCG CTG ATG GGG GAT 5731 
Gly Asp Val Gly Val Leu Glu Ser Val Val Ala Ser Leu Met Gly Asp 
2Q 1780 1785 1790 

GGG GTG GAG TGC CGG CGG TTG GAT GTG TCG CAT GGG TTT CAT TCG GTG 5779 
Gly Val Glu Cys Arg Arg Leu Asp Val Ser His Gly Phe His Ser Val 
1795 1800 1805 1810 

25 TTG ATG GAG CCG GTG TTG GGG GAG TTC CGG GGG GTT GTG GAG TCG TTG 5827 

Leu Met Glu Pro Val Leu Gly Glu Phe Arg Gly Val Val Glu Ser Leu 
1815 1820 1825 
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GAG TTC GGT CGG GTG CGG CCG GGT GTG GTG GTG GTG TCG GGT GTG TCG 5875 
Glu Phe Gly Arg Val Arg Pro Gly Val Val Val Val Ser Gly Val Ser 
1830 1835 1840 

GGT GGG GTG GTG GGT TCG GGG GAG TTG GGG GAT CCG GGG TAT TGG GTG 5923 
Gly Gly Val Val Gly Ser Gly Glu Leu Gly Asp Pro Gly Tyr Trp Val 
1845 1850 1855 

CGT CAT GCG CGG GAG GCG GTG CGT TTC GCG GAT GGG GTG GGG GTG GTG 5971 
Arg His Ala Arg Glu Ala Val Arg Phe Ala Asp Gly Val Gly Val Val 
1860 1865 1870 

CGT GGT CTG GGT GTG GGG ACG TTG GTG GAG GTG GGT CCG CAT GGG GTG 6019 
Arg Gly Leu Gly Val Gly Thr Leu Val Glu Val Gly Pro His Gly Val 
1875 1880 1885 1890 



CTG ACG GGG ATG GCG GGT GAG TGC CTG GGG GCC GGT GAT GAT GTG GTG 6067 
Leu Thr Gly Met Ala Gly Glu Cys Leu Gly Ala Gly Asp Asp Val Val 
45 1895 1900 1905 

GTG GTG CCG GCG ATG CGG CGG GGC CGT GCG GAG CGG GAG GTG TTC GAG 6115 
Val Val Pro Ala Met Arg Arg Gly Arg Ala Glu Arg Glu Val Phe Glu 
1910 1915 1920 



55 



GCG GCG CTG GCG ACG GTG TTC ACC CGG GAC GCC GGC CTG GAC GCC ACG 6163 

Ala Ala Leu Ala Thr Val Phe Thr Arg Asp Ala Gly Leu Asp Ala Thr 
1925 1930 1935 

GCA CTC CAC ACC GGG AGC ACC GGC CGG CGC ATC GAC CTC CCC ACC TAC 6211 

Ala Leu His Thr Gly Ser Thr Gly Arg Arg lie Asp Leu Pro Thr Tyr 
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1940 1945 1950 

CCC TTC CAA CGC GAC CGC TAC TGG CTG GAC CCC GTT CGC ACC GCC GTG 
Pro Phe Gin Arg Asp Arg Tyr Trp Leu Asp Pro Val Arg Thr Ala Val 
1955 1960 1965 1970 

ACC GGC GTC GAG CCC GCC GGC TCG CCG GCG GAC GCT CGG GCC ACT GAG 
Thr Gly Val Glu Pro Ala Gly Ser Pro Ala Asp Ala Arg Ala Thr Glu 
1975 1980 1985 

CGG GGA CGG TCG ACG ACG GCC GGG ATC CGC TAC CGC GTC GCT TGG CAG 
Arg Gly Arg Ser Thr Thr Ala Gly He Arg Tyr Arg Val Ala Trp Gin 
1990 1995 2000 

CCG GCC GTC GTC GAC CGC GGC AAC CCC GGG CCT GCC GGT CAT GTG CTG 
Pro Ala Val Val Asp Arg Gly Asn Pro Gly Pro Ala Gly His Val Leu 
2005 2010 2015 

CTT CTG GCC CCG GAC GAG GAC ACG GCC GAC TCC GGA CTC GCC CCC GCG 
Leu Leu Ala Pro Asp Glu Asp Thr Ala Asp Ser Gly Leu Ala Pro Ala 
2020 2025 2030 

ATC GCA CGT GAA CTC GCC GTG CGC GGG GCC GAG GTC CAC ACC GTC GCC 
He Ala Arg Glu Leu Ala Val Arg Gly Ala Glu Val His Thr Val Ala 
2035 2040 2045 2050 

GTG CCG GTC GGT ACA GGC CGG GAG GCA GCC GGG GAC CTG TTG CGG GCC 
Val Pro Val Gly Thr Gly Arg Glu Ala Ala Gly Asp Leu Leu Arg Ala 
2055 2060 2065 

GCC GGT GAC GGT GCC GCC CGC AGC ACC CGA GTT CTG TGG CTC GCC CCG 
Ala Gly Asp Gly Ala Ala Arg Ser Thr Arg Val Leu Trp Leu Ala Pro 
2070 2075 2080 

GCC GAG CCG GAC GCG GCC GAC GCC GTC GCC CTC GTC CAG GCG CTG GGC 
Ala Glu Pro Asp Ala Ala Asp Ala Val Ala Leu Val Gin Ala Leu Gly 
2085 2090 2095 

GAG GCG GTA CCC GAA GCC CCG CTC TGG ATC ACC ACC CGT GAG GCG GCG 
Glu Ala Val Pro Glu Ala Pro Leu Trp He Thr Thr Arg Glu Ala Ala 
2100 2105 2110 

GCC GTG CGG CCG GAC GAG ACC CCT TCC GTC GGG GGC GCT CAG CTG TGG 
Ala Val Arg Pro Asp Glu Thr Pro Ser Val Gly Gly Ala Gin Leu Trp 
2115 2120 2125 2130 

GGA CTC GGA CAG GTC GCC GCG CTC GAA CTG GGG CGG CGC TGG GGC GGC 
Gly Leu Gly Gin Val Ala Ala Leu Glu Leu Gly Arg Arg Trp Gly Gly 
2135 2140 ~ 2145 

TTG GCG GAC CTG CCC GGG AGT GCG TCG CCC GCG GTG CTC CGT ACG TTC 
Leu Ala Asp Leu Pro Gly Ser Ala Ser Pro Ala Val Leu Arg Thr Phe 
2150 2155 2160 



GTC GGG GCG CTG CTC GCC GGG GGA GAG AAC CAG TTC GCG GTA CGG CCC 
Val Gly Ala Leu Leu Ala Gly Gly Glu Asn Gin Phe Ala Val Arg Pro 
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2180 2185 2190 

GCC TCC GCT CGC ACC GTC ACC ACG GCC CCC GCC ACC GCC GTC GGC GAG 6979 
Ala Ser Ala Arg Thr Val Thr Thr Ala Pro Ala Thr Ala Val Gly Glu 
5 2195 2200 2205 2210 

GAC GCA CGG AAC GAC ACC TCG GAC GTG GTC GTG CCG GAC GAC CGG TGG 7027 
Asp Ala Arg Asn Asp Thr Ser Asp Val Val Val Pro Asp Asp Arg Trp 
2215 2220 2225 

10 TCC TCC GGC ACC GTA CTG ATC ACC GGG GGC ACC GGT GCC CTG GGT GCG 7 075 

Ser Ser Gly Thr Val Leu He Thr Gly Gly Thr Gly Ala Leu Gly Ala 
2230 2235 2240 

CAG GTC GCC CGC AGG CTC GCC CGG TCG GGC GCC GCG CGT CTG CTC CTG 7123 
15 Gin Val Ala Arg Arg Leu Ala Arg Ser Gly Ala Ala Arg Leu Leu Leu 

2245 2250 2255 

GTG GGC CGG CGC GGC GCG GCC GGC CCC GGA GTG GGC GAA CTC GTC GAG 7171 
Val Gly Arg Arg Gly Ala Ala Gly Pro Gly Val Gly Glu Leu Val Glu 
2260 2265 2270 

20 

GAG CTG ACG GCG CTC GGT TCC GAA GTG GCC GTC GAG GCC TGC GAC GTC 7219 
Glu Leu Thr Ala Leu Gly Ser Glu Val Ala Val Glu Ala Cys Asp Val 
2275 2280 2285 2290 

25 GCC GAC CGG GAC GCA CTG GCC GCG CTC CTC GCG GGC CTC CCC GAG GAG 7267 

Ala Asp Arg Asp Ala Leu Ala Ala Leu Leu Ala Gly Leu Pro Glu Glu 
2295 2300 2305 

CGG CCC CTC GTC GCC GTA CTG CAC GCG GCA GGT GTG CTC GAC GAC GGT 7315 
Arg Pro Leu Val Ala Val Leu His Ala Ala Gly Val Leu Asp Asp Gly 
2310 2315 2320 

GTG CTC GAC TCG CTC ACC TCC GAC CGG GTG GAC GCC GTA CTG CGG GAC 7363 
Val Leu Asp Ser Leu Thr Ser Asp Arg Val Asp Ala Val Leu Arg Asp 
2325 2330 2335 

35 

AAG GTC ACC GCC GCC CGT CAC CTG GAC GAG CTG ACC GCG GAC CTT CCG 7411 
Lys Val Thr Ala Ala Arg His Leu Asp Glu Leu Thr Ala Asp Leu Pro 
2340 2345 2350 

CTC GAC GCC TTC GTG CTC TTC TCC TCC ATC GTC GGC GTG TGG GGC AAC 7459 

40 

Leu Asp Ala Phe Val Leu Phe Ser Ser He Val Gly Val Trp Gly Asn 
2355 2360 2365 2370 

GGA GGG CAG GCC GTC TAC GCG GCC GCC AAC GCC GCG CTC GAC GCC CTG 7507 
Gly Gly Gin Ala Val Tyr Ala Ala Ala Asn Ala Ala Leu Asp Ala Leu 
45 2375 2380 2385 

GCG CAG CGG CGC CGG GCC AGG GGA GCC CGT GCC GCC TCG ATC GCC TGG 7555 
Ala Gin Arg Arg Arg Ala Arg Gly Ala Arg Ala Ala Ser He Ala Trp 
2390 2395 2400 



50 



55 



GGG CCG TGG GCC GGT GCC GGA ATG GCC TCC GGA ACG GCG GCG AAG TCC 7603 
Gly Pro Trp Ala Gly Ala Gly Met Ala Ser Gly Thr Ala Ala Lys Ser 
2405 2410 2415 

TTC GAA CGG GAC GGC GTC ACG GCC CTG GAC CCC GAG CGC GCG CTC GAC 7651 
Phe Glu Arg Asp Gly Val Thr Ala Leu Asp Pro Glu Arg Ala Leu Asp 
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2420 2425 2430 



5 


GTC CTC 
Val Leu 
2435 


GAC 
Asp 


GAC GTG GTG GGC 
Asp Val Val Gly 
2440 


GCC GGC GGG ACC TCT 
Ala Gly Gly Thr Ser 
2445 


GCC 
Ala 


GCA GGG ACG 
Ala Gly Thr 
2450 


7699 


10 


t-AC 

His 


GCG 

Ala 


GCC 
Ala 


GGC GAG AGC 
Gly Glu Ser 
2455 


TCC 
Ser 


CTG CTC GTC GCC GAC 
Leu Leu Val Ala Asp 
2460 


GTG 
Val 


GAC TGG GAG 
Asp Trp Glu 
2465 


7747 


ACC 
Thr 


TTC 
Phe 


GTC 
Val 


GGG CGT TCG 
Gly Arg Ser 
2470 


GTC 
Val 


ACC CGC CGT ACC TGG 
Thr Arg Arg Thr Trp 
2475 


TCG 
Ser 


CTC TTC GAC 
Leu Phe Asp 
2480 


7795 


15 


GGC 
Gly 


GTC 
Val 


TCC GCC 
Ser Ala 
2485 


GCC CGT 
Ala Arg 


TCG 
Ser 


GCG CGT GCC GGC CAT 
Ala Arg Ala Gly His 
2490 


GCC GCG GAC GAC 
Ala Ala Asp Asp 
2495 


7843 


20 


CGT 
Arg 


GCC GCT 
Ala Ala 
2500 


CTC 
Leu 


ACC CCA 
Thr Pro 


GGG ACG CGG CCG GGC GAC GGC 
Gly Thr Arg Pro Gly Asp Gly 
2505 2510 


GCA 
Ala 


CCG GGC 
Pro Gly 


7891 




GGG AGC 
Gly Ser 
2515 


GGA 
Gly 


CAG 
Gin 


GAC GGG GGC 
Asp Gly Gly 
2520 


GAG GGC CGG CCG TGG CTC 
Glu Gly Arg Pro Trp Leu 
2525 


TCC 
Ser 


GTC GGC 
Val Gly 
2530 


7939 


25 


CCC 
Pro 


TCG 
Ser 


CCG 
Pro 


GCG 
Ala 


GAA CGC 
Glu Arg 
2535 


CGT 
Arg 


CGT GCT CTG CTC ACG 
Arg Ala Leu Leu Thr 
2540 


CTT 
Leu 


GTG 
Val 


CGC TCG 
Arg Ser 
2545 


7987 




GAG 


GCC 


GCC 


GGG 


ATC CTG 


CGC 


CAC GCC TCG GCC GAC 


GCG 


GTC 


GAC CCG 


8035 



30 Glu Ala Ala G1 y He Leu Arg His Ala Ser Ala Asp Ala Val Asp Pro 

2550 2555 2560 

GAG CTG GCC TTC CGG TCC GCC GGG TTC GAC TCC CTC ACC GTT CTC GAA 8083 

Glu Leu Ala Phe Arg Ser Ala Gly Phe Asp Ser Leu Thr Val Leu Glu 
2565 2570 2575 

35 

CTG CGT AAC CGC CTG ACC GCT GCC ACC GGC CTG AAC CTG CCG AAC ACG 8131 

Leu Arg Asn Arg Leu Thr Ala Ala Thr Gly Leu Asn Leu Pro Asn Thr 
2580 2585 2590 

40 CTG CTC TTC GAC CAC CCG ACC CCC CTC TCG CTC GCC TCC CAC CTG CAC 8179 

Leu Leu Phe Asp His Pro Thr Pro Leu Ser Leu Ala Ser His Leu His 
2595 2600 2605 2610 

GAC GAA CTG TTC GGT CCC GAC AGC GAG GCG GAG CCG GCA GCG GCC GCC 8227 
Asp Glu Leu Phe Gly Pro Asp Ser Glu Ala Glu Pro Ala Ala Ala Ala 
2615 2620 2625 

CCC ACG CCG GTC ATG GCC GAC GAG CGT GAG CCG ATC GCG ATC GTG GGC 8275 

Pro Thr Pro Val Met Ala Asp Glu Arg Glu Pro He Ala He Val Gly 
2630 2635 2640 

50 

ATG GCG TGC CGT TAC CCG GGC GGT GTG GCG TCG CCG GAC GAC CTG TGG 8323 

Met Ala Cys Arg Tyr Pro Gly Gly Val Ala Ser Pro Asp Asp Leu Trp 
2645 2650 2655 

55 GAC CTG GTG GCC GGT GAC GGG CAC ACG CTC TCC CCG TTC CCG GCC GAC 8371 

Asp Leu Val Ala Gly Asp Gly His Thr Leu Ser Pro Phe Pro Ala Asp 
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2660 2665 2670 

CGT GGC TGG GAC GTC GAG GGG CTG TAC GAC CCG GAG CCG GGG GTG CCG 8419 
Arg Gly Trp Asp Val Glu Gly Leu Tyr Asp Pro Glu Pro Gly Val Pro 
2675 2680 2685 2690 

GGC AAG AGC TAT GTA CGG GAA GGC GGG TTC CTG CGT TCC GCG GCC GAG 8467 
Gly Lys Ser Tyr Val Arg Glu Gly Gly Phe Leu Arg Ser Ala Ala Glu 
2695 2700 2705 

TTC GAC GCG GAG TTC TTC GGG ATA TCG CCG CGC GAG GCC ACG GCC ATG 8515 
Phe Asp Ala Glu Phe Phe Gly lie Ser Pro Arg Glu Ala Thr Ala Met 
2710 2715 2720 

GAC CCG CAG CAG CGG TTG CTG CTG GAG ACG TCG TGG GAG GCG CTG GAG 8563 
is Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu 

2725 2730 2735 

CGG GCC GGC ATC GTT CCG GAC TCG CTG CGC GGC ACC CGG ACC GGT GTC 8611 

Arg Ala Gly lie Val Pro Asp Ser Leu Arg Gly Thr Arg Thr Gly Val 

2740 2745 2750 

20 

TTC AGC GGC ATC TCC CAG CAG GAC TAC GCG ACC CAG CTG GGG GAC GCC 8659 

Phe Ser Gly lie Ser Gin Gin Asp Tyr Ala Thr Gin Leu Gly Asp Ala 

2755 2760 2765 2770 

25 GCC GAC ACC TAC GGC GGG CAT GTG CTC ACG GGG ACC CTC GGC AGT GTG 8707 

Ala Asp Thr Tyr Gly Gly His Val Leu Thr Gly Thr Leu Gly Ser Val 
2775 2780 2785 



30 



35 



40 



50 



55 



ATC TCC GGT CGG GTT GCC TAT GCG TTG GGG TTG GAG GGG CCG GCG CTG 8755 
lie Ser Gly Arg Val Ala Tyr Ala Leu Gly Leu Glu Gly Pro Ala Leu 
2790 2795 2800 

ACG GTG GAC ACG GCG TGT TCG TCG TCG TTG GTG GCG TTG CAT CTG GCG 8803 
Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala 
2805 2810 2815 

GTG CAG TCG TTG CGG CGG GGT GAG TGT GAT CTG GCG TTG GCC GGT GGG 8851 
Val Gin Ser Leu Arg Arg Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly 
2820 2825 2830 

GTG ACG GTG ATG GCG ACG CCG ACG GTG TTC GTG GAG TTC TCG CGG CAG 8899 
Val Thr Val Met Ala Thr Pro Thr Val Phe Val Glu Phe Ser Arg Gin 
2835 2840 2845 2850 



CGG GGG CTG GCG GCG GAC GGG CGG TGC AAG GCG TTC GCG GAG GGT GCG 8947 
Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe Ala Glu Gly Ala 
45 2855 2860 2865 

GAC GGG ACG GCG TGG GCG GAG GGT GTG GGT GTG CTG CTG GTG GAG CGG 8995 
Asp Gly Thr Ala Trp Ala Glu Gly Val Gly Val Leu Leu Val Glu Arg 
2870 2875 2880 



CTT TCC GAC GCG CGC CGC AAC GGT CAT CGG GTG CTG GCG GTG GTG CGG 9043 
Leu Ser Asp Ala Arg Arg Asn Gly His Arg Val Leu Ala Val Val Arg 
2885 2890 2895 

GGC AGT GCG GTC AAT CAG GAC GGT GCG AGC AAT GGG CTG ACG GCG CCG 9091 
Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro 
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2900 2905 2910 

AGT GGT CCG GCG CAG CAG CGG GTG ATC CGT GAG GCG CTG GCT GAT GCG 
Ser Gly Pro Ala Gin Gin Arg Val lie Arg Glu Ala Leu Ala Asp Ala 
2915 2920 2925 2930 

GGG CTG GTG CCC GCC GAC GTG GAT GTG GTG GAG GCG CAC GGT ACG GGG 
Gly Leu Val Pro Ala Asp Val Asp Val Val Glu Ala His Gly Thr Gly 
2935 2940 2945 

ACG GCG CTG GGT GAT CCG ATC GAG GCG GGT GCG CTG CTG GCC ACG TAC 
Thr Ala Leu Gly Asp Pro lie Glu Ala Gly Ala Leu Leu Ala Thr Tyr 
2950 2955 2960 

GGG CGG GAG CGG GTC GGC GAT CCG TTG TGG CTC GGG TCG TTG AAG TCG 
Gly Arg Glu Arg Val Gly Asp Pro Leu Trp Leu Gly Ser Leu Lys Ser 
2965 2970 2975 

AAC ATC GGG CAT GCG CAG GCG GCT GCG GGT GTG GGT GGT GTG ATC AAG 
Asn He Gly His Ala Gin Ala Ala Ala Gly Val Gly Gly Val He Lys 
2980 2985 2990 

GTG GTG CAG GGG ATG CGG CAT GGG TCG TTG CCG CGG ACG CTG CAT GTG 
Val Val Gin Gly Met Arg His Gly Ser Leu Pro Arg Thr Leu His Val 
2995 3000 3005 3010 

GAT GCG CCG TCG TCG AAG GTG GAG TGG GCT TCG GGT GCG GTG GAG CTG 
Asp Ala Pro Ser Ser Lys Val Glu Trp Ala Ser Gly Ala Val Glu Leu 
3015 3020 3025 

CTG ACC GAG ACC CGG TCG TGG CCG CGG CGG GTG GAG CGG GTG CGG CGG 
Leu Thr Glu Thr Arg Ser Trp Pro Arg Arg Val Glu Arg Val Arg Arg 
3030 3035 3040 

GCC GCG GTG TCG GCG TTC GGG GTG AGC GGG ACC AAC GCC CAT GTG GTC 
Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala His Val Val 
3045 3050 3055 

CTG GAG GAA GCG CCG GCG GAG GCC GGG AGC GAG CAC GGG GAC GGC CCT 
Leu Glu Glu Ala Pro Ala Glu Ala Gly Ser Glu His Gly Asp Gly Pro 
3060 3065 3070 

GAA CCT GAG CGG CCC GAC GCG GTG ACG GGT CCG TTG TCG TGG GTG CTT 
Glu Pro Glu Arg Pro Asp Ala Val Thr Gly Pro Leu Ser Trp Val Leu 
3075 3080 3085 3090 

TCT GCG CGG TCG GAG GGG GCG TTG CGG GCG CAG GCG GTG CGG TTG CGT 
Ser Ala Arg Ser Glu Gly Ala Leu Arg Ala Gin Ala Val Arg Leu Arg 
3095 3100 3105 

GAG TGT GTG GAG CGG GTG GGT GCG GAT CCG CGG GAT GTG GCG GGG TCG 
Glu Cys Val Glu Arg Val Gly Ala Asp Pro Arg Asp Val Ala Gly Ser 
3110 3115 3120 

TTG GTG GTG TCG CGT GCG TCG TTC GGT GAG CGT GCG GTG GTG GTC GGC 
Leu Val Val Ser Arg Ala Ser Phe Gly Glu Arg Ala Val Val Val Gly 
3125 3130 3135 

CGG GGG CGT GAG GAG TTG CTG GCG GGT CTG GAT GTG GTG GCT GCC GGG 
Arg Gly Arg Glu Glu Leu Leu Ala Gly Leu Asp Val Val Ala Ala Gly 
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3140 



3145 



3150 



10 



GCT CCT GTG GTG TCT TCG GGG GCC GGT GCT GTG GTG CGG GGG AGT 9859 

Ala Pro Val Gly Val Ser Ser Gly Ala Gly Ala Val Val Arg Gly Ser 
3155 3160 3165 3170 

GCG GTG CGG GGT CGT GGG GTG GGG GTG TTG TTC ACG GGT CAG GGT GCG 9907 
Ala Val Arg Gly Arg Gly Val Gly Val Leu Phe Thr Gly Gin Gly Ala 
3175 3180 3185 

CAG TGG GTT GGT ATG GGG CGT GGG TTG TAT GCG GGG GGT GGG GTG TTT 9955 
Gin Trp Val Gly Met Gly Arg Gly Leu Tyr Ala Gly Gly Gly Val Phe 
3190 3195 3200 



GCG GAG GTG CTG GAT GAG GTG TTG TCG GTG GTG GGG GAG GTG GAT GGT 
15 Ala Glu Val Leu Asp Glu Val Leu Ser Val Val Gly Glu Val Asp Gly 

3205 3210 3215 



10003 



20 



CGG TCG TTG CGG GAT GTG ATG TTC GCG GAT GCT GAC TCG GTT TTG GGT 10051 
Arg Ser Leu Arg Asp Val Met Phe Ala Asp Ala Asp Ser Val Leu Gly 
3220 3225 3230 

GGG TTG TTG GGT CGG ACG GAG TTT GCT CAG CCT GCG TTG TTT GCG TTG 10099 
Gly Leu Leu Gly Arg Thr Glu Phe Ala Gin Pro Ala Leu Phe Ala Leu 
3235 3240 3245 3250 



25 GAG GTG GCG TTG TTC CGG GCG TTG GAG GCT CGG GGT GTG GAG GTG TCG 

Glu Val Ala Leu Phe Arg Ala Leu Glu Ala Arg Gly Val Glu Val Ser 
3255 3260 3265 



10147 



30 



GTG GTG TTG GGT CAT TCG GTG GGG GAG GTG GCT GCT GCG TAT GTG GCG 
Val Val Leu Gly His Ser Val Gly Glu Val Ala Ala Ala Tyr Val Ala 
3270 3275 3280 



10195 



35 



GGG GTG TTG TCG TTG GGT GAT GCG GTG CGG TTG GTG GTG GCG CGG GGT 10243 
Gly Val Leu Ser Leu Gly Asp Ala Val Arg Leu Val Val Ala Arg Gly 
3285 3290 3295 

GGG TTG ATG GGT GGG TTG CCG GTG GGT GGG GGG ATG TGG TCG GTG GGG 10291 
Gly Leu Met Gly Gly Leu Pro Val Gly Gly Gly Met Trp Ser Val Gly 
3300 3305 3310 



40 



GCG TCG GAG TCG GTG GTG CGG GGG GTT GTT GAG GGG TTG GGG GAG TGG 
Ala Ser Glu Ser Val Val Arg Gly Val Val Glu Gly Leu Gly Glu Trp 
3315 3320 3325 3330 



10339 



45 



GTG TCG GTT GCG GCG GTG AAT GGG CCG CGG TCG GTG GTG TTG TCG GGT 
Val Ser Val Ala Ala Val Asn Gly Pro Arg Ser Val Val Leu Ser Gly 
3335 3340 3345 



10387 



50 



GAT GTG GGT GTG CTG GAG TCG GTG GTT GTC ACG CTG ATG GGG GAT GGG 10435 
Asp Val Gly Val Leu Glu Ser Val Val Val Thr Leu Met Gly Asp Gly 
3350 3355 3360 

GTG GAG TGC CGG CGG TTG GAT GTG TCG CAT GGG TTT CAT TCG GTG TTG 10483 
Val Glu Cys Arg Arg Leu Asp Val Ser His Gly Phe His Ser Val Leu 
3365 3370 3375 



55 



ATG GAG CCG GTG TTG GGG GAG TTC CGG GGG GTT GTG GAG TCG TTG GAG 
Met Glu Pro Val Leu Gly Glu Phe Arg Gly Val Val Glu Ser Leu Glu 
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3380 



3385 



3390 
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55 



TTC GGT CGG GTG CGG CCG GGT GTG GTG GTG GTG TCG GGT GTG TCG GGT 10579 
Phe Gly Arg Val Arg Pro Gly Val Val Val Val Ser Gly Val Ser Gly 
3395 3400 3405 3410 

GGG GTG GTG GGT TCG GGG GAG TTG GGG GAT CCG GGG TAT TCG GTG CGT 10627 
Gly Val Val Gly Ser Gly Glu Leu Gly Asp Pro Gly Tyr Trp Val Arg 
3415 3420 M 3425 

CAT GCG CGG GAG GCG GTG CGT TTC GCG GAT GGG GTG GGG GTG GTG CGT 10675 
His Ala Arg Glu Ala Val Arg Phe Ala Asp Gly Val Gly Val Val Arg 
3430 3435 3440 

GGT CTG GGT GTG GGG ACG TTG GTG GAG GTG GGT CCG CAT GGG GTG CTG 10723 
Gly Leu Gly Val Gly Thr Leu Val Glu Val Gly Pro His Gly Val Leu 
3445 3450 3455 

ACG GGG ATG GCG GGT CAG TGC CTG GAG GCC GGT GAT GAT GTG GTG GTG 10771 
Thr Gly Met Ala Gly Gin Cys Leu Glu Ala Gly Asp Asp Val Val Val 
3460 3465 3470 

GTG CCG GCG ATG CGG CGG GGC CGT CCG GAG CGG GAG GTG TTC GAG GCG 10819 
Val Pro Ala Met Arg Arg Gly Arg Pro Glu Arg Glu Val Phe Glu Ala 
3475 3480 3485 3490 

GCG CTG GCG ACG GTG TTC ACC CGG GAC GCC GGC CTC GAC GCC ACG ACA 10867 
Ala Leu Ala Thr Val Phe Thr Arg Asp Ala Gly Leu Asp Ala Thr Thr 
3495 3500 3505 

CTC CAC ACC GGG AGC ACC GGC CGA CGC ATC GAC CTC CCC ACC TAC CCC 10915 
Leu His Thr Gly Ser Thr Gly Arg Arg He Asp Leu Pro Thr Tyr Pro 
3510 3515 3520 

TTC CAA CAC AAC CGC TAC TGG GCA ACC GGC TCA GTG ACC GGT GCG ACC 10963 
Phe Gin His Asn Arg Tyr Trp Ala Thr Gly Ser Val Thr Gly Ala Thr 
3525 3530 3535 

GGC ACC TCG GCA GCC GCG CGC TTC GGC CTG GAG TGG AAG GAC CAC CCC 11011 
Gly Thr Ser Ala Ala Ala Arg Phe Gly Leu Glu Trp Lys Asp His Pro 
3540 3545 3550 

TTC CTC AGC GGC GCC ACG CCG ATA GCC GGC TCC GGC GCG CTG CTC CTC 11059 
Phe Leu Ser Gly Ala Thr Pro He Ala Gly Ser Gly Ala Leu Leu Leu 
3555 3560 3565 3570 

ACC GGC AGG GTG GGG CTC GCT GCC CAC CCG TGG CTG GCC GAC CAC GCC 11107 
Thr Gly Arg Val Gly Leu Ala Ala His Pro Trp Leu Ala Asp His Ala 
3575 3580 3585 

ATC TCC GGC ACG GTG CTG CTC CCC GGA ACG GCG ATC GCC GAC CTG CTG 11155 
He Ser Gly Thr Val Leu Leu Pro Gly Thr Ala He Ala Asp Leu Leu 
3590 3595 3500 

CTG CGG GCG GTC GAG GAG GTC GGC GCC GGA GGG GTC GAG GAA CTG ACG 11203 
Leu Arg Ala Val Glu Glu Val Gly Ala Gly Gly Val Glu Glu Leu Thr 
3605 3610 3615 

CTC CAT GAG CCC CTG CTC CTC CCC GAG CGA GGC GGC CTG CAC GTC CAG 11251 
Leu His Glu Pro Leu Leu Leu Pro Glu Arg Gly Gly Leu His Val Gin 
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3620 



3625 



3630 



GTC CTG GTC GAG GCG GCC GAC GAG CAG GGA CGG CGT GCC GTG GCA GTC 
Val Leu Val Glu Ala Ala Asp Glu Gin Gly Arg Arg Ala Val Ala Val 
3635 3640 3645 3650 



10 



11299 



GCC GCA CGC CCG GAG GGC CCT GGG CGG GAC GGT GAG GAA CAG GAG TGG 11347 
Ala Ala Arg Pro Glu Gly Pro Gly Arg Asp Gly Glu Glu Gin Glu Trp 
3655 3660 3665 

ACC CGG CAC GCG GAA GGC GTG CTC ACC TCC ACC GAG ACG GCC GTT CCG 11395 
Thr Arg His Ala Glu Gly Val Leu Thr Ser Thr Glu Thr Ala Val Pro 
3670 3675 3680 



75 



GAC ATG GGC TGG GCC GCC GGG GCC TGG CCG CCG CCC GGT GCC GAG CCG 
Asp Met Gly Trp Ala Ala Gly Ala Trp Pro Pro Pro Gly Ala Glu Pro 
3685 3690 3695 



20 



11443 



ATC GAC GTC GAG GAG CTG TAC GAC GCG TTC GCC GCG GAC GGC TAC GGC 11491 
He Asp Val Glu Glu Leu Tyr Asp Ala Phe Ala Ala Asp Gly Tyr Gly 
3700 3705 3710 

TAC GGC CCG GCC TTC ACC GCA CTG TCC GGC GTG TGG CGT CTC GGC GAC 11539 
Tyr Gly Pro Ala Phe Thr Ala Leu Ser Gly Val Trp Arg Leu Gly Asp 
3715 3720 3725 3730 



25 



GAA CTC TTC GCC GAG GTG CGG CGG CCC GCG GGG GGC GCG GGC ACG ACC 
Glu Leu Phe Ala Glu Val Arg Arg Pro Ala Gly Gly Ala Gly Thr Thr 
3735 3740 3745 



11587 



30 



GGT GAC GGT TTC GGC GTC CAC CCC GCA CTC TTC GAT GCG GCC CTC CAC 
Gly Asp Gly Phe Gly Val His Pro Ala Leu Phe Asp Ala Ala Leu His 
3750 3755 3760 



11635 



35 



CCG TGG CGC GCC GGC GGG CTG CTG CCC GAC ACG GGC GGC ACC ACC TGG 11683 
Pro Trp Arg Ala Gly Gly Leu Leu Pro Asp Thr Gly Gly Thr Thr Trp 
3765 3770 3775 

GCG CCG TTC TCC TGG CAG GGC ATC GCG CTC CAC ACC ACC GGA GCC GAG 11731 
Ala Pro Phe Ser Trp Gin Gly He Ala Leu His Thr Thr Gly Ala Glu 
3780 3785 3790 



40 



ACG CTC CGC GTC AGA CTG GCC CCT GCG GCC GGC GGC ACC GAG TCG GCC 
Thr Leu Arg Val Arg Leu Ala Pro Ala Ala Gly Gly Thr Glu Ser Ala 
3795 3800 3805 3810 



11779 



45 



TTC TCC GTA CAG GCC GCC GAC CCG GCG GGC ACC CCG GTC CTC ACC CTC 
Phe Ser Val Gin Ala Ala Asp Pro Ala Gly Thr Pro Val Leu Thr Leu 
3815 3820 3825 



11827 



50 



GAC GCA CTG CTG CTC CGC CCG GTG ACC CTG GGG AGG GCC GAC GCG CCG 11875 
Asp Ala Leu Leu Leu Arg Pro Val Thr Leu Gly Arg Ala Asp Ala Pro 
3830 3835 3840 

CAA CCG CTG TAC CGC GTC GAC TGG CAG CCG GTC GGC CAG GGG ACC GAG 11923 
Gin Pro Leu Tyr Arg Val Asp Trp Gin Pro Val Gly Gin Gly Thr Glu 
3845 3850 3855 



55 



GCC TCC GGC GCC CAG GGC TGG ACG GTG CTC GGG CAG GCC GCG GCC GAG 
Ala Ser Gly Ala Gin Gly Trp Thr Val Leu Gly Gin Ala Ala Ala Glu 



11971 



194 



EP 0 791 655 A2 



3860 3865 3870 

ACG GTC GCG CAG CCC GCC GCC CAT GCG GAC CTC ACC GCC CTG CGT ACG 
Thr Val Ala Gin Pro Ala Ala His Ala Asp Leu Thr Ala Leu Arg Thr 
3875 3880 3885 3890 

GCT GTG GCC GCG GCG GGA ACA CCC GTG CCC CGG CTG GTG GTC GTG TCG 
Ala Val Ala Ala Ala Gly Thr Pro Val Pro Arg Leu Val Val Val Ser 
3895 3900 3905 

CCG GTG GAC ACC CGG CTG GAC GAG GGG CCG GTG CTG GCG GAC GCC GAG 
Pro Val Asp Thr Arg Leu Asp Glu Gly Pro Val Leu Ala Asp Ala Glu 
3910 3915 3920 

GCT CGG GCC CGT GCG GGT GAC GGC TGG GAC GAC GAT CCC CTA CGT GTC 
Ala Arg Ala Arg Ala Gly Asp Gly Trp Asp Asp Asp Pro Leu Arg Val 
3925 3930 3935 

GCC CTC GGG CGC GGC CTG ACC CTG GTC CGG GAG TGG GTC GAG GAC GAA 
Ala Leu Gly Arg Gly Leu Thr Leu Val Arg Glu Trp Val Glu Asp Glu 
3940 3945 3950 

CGG TIG GCG GAC TCC CGG CTC GTC GTC CTC ACC CGT GGC GCG GTG GCG 
Arg Leu Ala Asp Ser Arg Leu Val Val Leu Thr Arg Gly Ala Val Ala 
3^55 3960 3965 " 3970 

GCC GGT CCC GGC GAT GTG CCG GAC CTG ACA GGT GCG GCC CTG TGG GGG 
Ala Gly Pro Gly Asp Val Pro Asp Leu Thr Gly Ala Ala Leu Trp Gly 
3975 3980 3985 

CTG CTC CGC TCC GCG CAG TCG GAG TAT CCG GAC CGC TTC ACC CTC ATC 
Leu Leu Arg Ser Ala Gin Ser Glu Tyr Pro Asp Arg Phe Thr Leu He 
3990 3995 4000 

GAC GTG GAC GAT TCC CCC GAG TCC CGT GCG GCT CTG CCC CGG GCT CTG 
Asp Val Asp Asp Ser Pro Glu Ser Arg Ala Ala Leu Pro Arg Ala Leu 
4005 4010 4015 

GGA TCG GCC GAG CGA CAA CTC GCC CTG CGG ACG GGC GAC GTG CTG GCG 
Gly Ser Ala Glu Arg Gin Leu Ala Leu Arg Thr Gly Asp Val Leu Ala 
4020 4025 4030 

CCG GCC CTG GTC CCG ATG GCC ACC CGG CCG GCG GAG ACC ACT CCA GCG 
Pro Ala Leu Val Pro Met Ala Thr Arg Pro Ala Glu Thr Thr Pro Ala 
4035 4040 4045 4050 

ACG GCG GTC GCC TCG GCG ACA ACA CAG ACA CAG GTC ACC GCG CCC GCT 
Thr Ala Val Ala Ser Ala Thr Thr Gin Thr Gin Val Thr Ala Pro Ala 
4055 4060 4065 

CCC GAC GAC CCG GCT GCG GAT GCC GTG TTC GAC CCG GCG GGC ACC GTA 
Pro Asp Asp Pro Ala Ala Asp Ala Val Phe Asp Pro Ala Gly Thr Val 
4070 4075 4080 

CTG ATC ACC GGC GGC ACC GGC GCC CTG GGA CGG CGT GTC GCC TCG CAC 
Leu He Thr Gly Gly Thr Gly Ala Leu Gly Arg Arg Val Ala Ser His 
4085 4090 4095 

CTC GCG CGC CGG TAC GGC GTA CGC CAC ATG CTT CTG GTC AGC AGG CGT 
Leu Ala Arg Arg Tyr Gly Val Arg His Met Leu Leu Val Ser Arg Arg 
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4100 



4105 



4110 



GGA CCG GAC GCC CCC GAG GCC GGT CCC CTG GAA CGG GAA CTC GCC GGT 
Gly Pro Asp Ala Pro Glu Ala Gly Pro Leu Glu Arg Glu Leu Ala Gly 
4115 4120 4125 4130 



12739 



10 



CTC GGA GTC ACC GCC ACC TTC CTG GCA TGC GAC CTC ACC GAC ATC GAG 12787 

Leu Gly Val Thr Ala Thr Phe Leu Ala Cys Asp Leu Thr Asp lie Glu 
4135 4140 4145 

GCC GTA CGG AAG GCC GTC GCC GCG GTG CCG TCG GAC CAC CCG CTG ACC 1283 5 

Ala Val Arg Lys Ala Val Ala Ala Val Pro Ser Asp His Pro Leu Thr 

4150 4155 4160 



75 



GGT GTG GTG CAC ACC GCC GGC GTG CTG GAC GAC GGC GCC CTG ACC GGC 
Gly Val Val His Thr Ala Gly Val Leu Asp Asp Gly Ala Leu Thr Gly 
4165 4170 4175 



12883 



20 



CTG ACC CGG CAA CGC CTC GAC ACC GTG CTG CGG CCC AAG GCC GAC GCC 12931 
Leu Thr Arg Gin Arg Leu Asp Thr Val Leu Arg Pro Lys Ala Asp Ala 
4180 4185 4190 

GTG CGG AAC CTC CAC GAG GCG ACC CTC GAC CGG CCG CTG CGC GCG TTC 12979 
Val Arg Asn Leu His Glu Ala Thr Leu Asp Arg Pro Leu Arg Ala Phe 
4195 4200 4205 4210 



25 GTC CTG TTC TCC GCC GCC GCC GGA CTC CTG GGC CGC CCC GGG CAG GCC 

Val Leu Phe Ser Ala Ala Ala Gly Leu Leu Gly Arg Pro Gly Gin Ala 
4215 4220 4225 



13027 



TCC TAC GCC GCC GCC AAC GCG GTC CTC GAC GCG CTC GCG GGA GCC CGC 
3Q Ser Tyr Ala Ala Ala Asn Ala Val Leu Asp Ala Leu Ala Gly Ala Arg 

4230 4235 4240 



13075 



35 



CGC GCG GCC GGA CTG CCC GCA GTG TCC CTG GCG TGG GGC CTG TGG GAC 13123 
Arg Ala Ala Gly Leu Pro Ala Val Ser Leu Ala Trp Gly Leu Trp Asp 
4245 4250 4255 

GAG CAG ACG GGC ATG GCA GGA GGC CTC GAC GAG ATG GCC CTG CGC GTG 13171 
Glu Gin Thr Gly Met Ala Gly Gly Leu Asp Glu Met Ala Leu Arg Val 
4260 4265 4270 



40 



CTG CGC CGG GAC GGC ATC GCC GCG ATG CCT CCG GAG CAG GGG CTC GAA 
Leu Arg Arg Asp Gly lie Ala Ala Met Pro Pro Glu Gin Gly Leu Glu 
4275 4280 4285 4290 



13219 



45 



CTG CTC GAC CTG GCC CTG ACC GGA CAC CGG GAC GGA CCC GCC GTC CTC 
Leu Leu Asp Leu Ala Leu Thr Gly His Arg Asp Gly Pro Ala Val Leu 
4295 4300 4305 



13267 



50 



GTC CCC CTC CTC CTC GAC GGC GCG GCC CTG CGC CGC ACG GCG AAG GAG 13315 
Val Pro Leu Leu Leu Asp Gly Ala Ala Leu Arg Arg Thr Ala Lys Glu 
4310 4315 " 4320 

CGC GGC GCG GCC ACG ATG TCC CCC TTG CTG CGC GCC CTG CTG CCC GCC 13363 
Arg Gly Ala Ala Thr Met Ser Pro Leu Leu Arg Ala Leu Leu Pro Ala 
4325 4330 4335 



55 



GCC CTG CGC CGC AGC GGT GGA GCC GGC GCC CCC GCG GCG GCC GAC CGG 
Ala Leu Arg Arg Ser Gly Gly Ala Gly Ala Pro Ala Ala Ala Asp Arg 
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4340 



4345 



4350 



10 



CAC GGC AAG GAG GCG GAC CCC GOT GCG GGA CGC CTC GCA GGG ATG GTG 
His Gly Lys Glu Ala Asp Pro Gly Ala Gly Arg Leu Ala Gly Met Val 
4355 4360 4365 4370 



13459 



GCA CTC GAA GCG GCG GAG CGT TCC GCG GCC GTC CTT GAG CTG GTC ACC 13507 
Ala Leu Glu Ala Ala Glu Arg Ser Ala Ala Val Leu Glu Leu Val Thr 
4375 4380 4385 

GAA CAG GTC GCC GAG GTC CTC GGC TAC GCG TCG GCC GCG GAG ATC GAG 13555 
Glu Gin Val Ala Glu Val Leu Gly Tyr Ala Ser Ala Ala Glu lie Glu 
4390 4395 4400 



75 



20 



CCC GAA CGA CCC TTC CGG GAG ATC GGC GTC GAC TCC CTG GCG GCG GTG 13603 
Pro Glu Arg Pro Phe Arg Glu He Gly Val Asp Ser Leu Ala Ala Val 
4405 4410 4415 

GAG CTG CGC AAC CGG CTC AGC CGT CTG GTC GGC CTG CGG TTG CCG ACC 13651 
Glu Leu Arg Asn Arg Leu Ser Arg Leu Val Gly Leu Arg Leu Pro Thr 
4420 4425 4430 



25 



30 



35 



40 



45 



ACG CTG TCC TTC GAC CAC CCC ACG CCG AAG GAC ATG GCG CAG CAC ATC 13699 
Thr Leu Ser Phe Asp His Pro Thr Pro Lys Asp Met Ala Gin His He 
4435 4440 4445 4450 

GAC GGG CAG CTC CCC CGC CCG GCC GGA GCC TCG CCC GCG GAC GCA GCG 13747 
Asp Gly Gin Leu Pro Arg Pro Ala Gly Ala Ser Pro Ala Asp Ala Ala 
4455 4460 4465 

CTG GAA GGG ATC GGC GAC CTC GCG CGG GCG GTC GCC CTG CTG GGC ACG 13795 
Leu Glu Gly He Gly Asp Leu Ala Arg Ala Val Ala Leu Leu Gly Thr 
4470 4475 4480 

GGC GAC GCC CGC CGG GCC GAG GTA CGA GAG CAG CTC GTC GGA CTG CTG 13843 
Gly Asp Ala Arg Arg Ala Glu Val Arg Glu Gin Leu Val Gly Leu Leu 
4485 4490 4495 

GCC GCG CTC GAC CCA CCT GGG CGG ACG GGC ACC GCC GCA CCC GGC GTC 13891 
Ala Ala Leu Asp Pro Pro Gly Arg Thr Gly Thr Ala Ala Pro Gly Val 
4500 4505 4510 

CCC TCC GGT GCC GAT GGC GCG GAA CCG ACC GTG ACG GAC CGG CTC GAC 13939 
Pro Ser Gly Ala Asp Gly Ala Glu Pro Thr Val Thr Asp Arg Leu Asp 
4515 4520 4525 4530 

GAG GCG ACC GAC GAC GAG ATC TTC GCC TTC CTG GAC GAG CAG CTG TGA 13987 
Glu Ala Thr Asp Asp Glu He Phe Ala Phe Leu Asp Glu Gin Leu * 
4535 4540 4545 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4546 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Ser Gly Glu Leu Ala He Ser Arg Ser Asp Asp Arg Ser Asp Ala 
15 10 15 

Val Ala Val Val Gly Met Ala Cys Arg Phe Pro Gly Ala Pro Gly He 
20 25 30 

Ala Glu Phe Trp Glu Leu Leu Arg Ser Gly Arg Gly Met Pro Thr Arg 
35 40 45 

Gin Asp Asp Gly Thr Trp Arg Ala Ala Leu Glu Asp His Ala Gly Phe 
50 55 60 

Asp Ala Gly Phe Phe Gly Met Asn Ala Arg Gin Ala Ala Ala Thr Asp 
15 65 70 75 80 

Pro Gin His Arg Leu Met Leu Glu Leu Gly Trp Glu Ala Leu Glu Asp 
85 90 95 



10 



20 



35 



Ala Gly He Val Pro Gly Asp Leu Thr Gly Thr Asp Thr Gly Val Phe 
100 105 110 

Ala Gly Val Ala Ser Asp Asp Tyr Ala Val Leu Thr Arg Arg Ser Ala 
115 120 125 

25 Val Ser Ala Gly Gly Tyr Thr Ala Thr Gly Leu His Arg Ala Leu Ala 

130 135 140 

Ala Asn Arg Leu Ser His Phe Leu Gly Leu Arg Gly Pro Ser Leu Val 
145 150 155 160 

30 

Val Asp Ser Ala Gin Ser Ala Ser Leu Val Ala Val Gin Leu Ala Cys 
165 170 175 

Glu Ser Leu Arg Arg Gly Glu Thr Ser Leu Ala Val Ala Gly Gly Val 
180 185 190 

Asn Leu He Leu Thr Glu Glu Ser Thr Thr Val Met Glu Arg Met Gly 
195 200 205 

Ala Leu Ser Pro Asp Gly Arg Cys His Thr Phe Asp Ala Arg Ala Asn 
210 215 220 

Gly Tyr Val Arg Gly Glu Gly Gly Gly Ala Val Val Leu Lys Pro Leu 
225 230 235 240 

Asp Ala Ala Leu Ala Asp Gly Asp Arg Val Tyr Cys Val He Lys Gly 
45 245 250 255 

Gly Ala Val Asn Asn Asp Gly Gly Gly Ala Ser Leu Thr Thr Pro Asp 
260 265 270 

so teg Glu Ala Gin Glu Ala Val Leu Arg Gin Ala Tyr Arg Arg Ala Gly 

275 280 285 

Val Ser Thr Gly Ala Val Arg Tyr Val Glu Leu His Gly Thr Gly Thr 
290 295 300 



40 



55 



Arg Ala Gly Asp Pro Val Glu Ala Ala Ala Leu Gly Ala Val Leu Gly 
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305 



310 



315 



320 



Ala Gly Ala Asp Ser Gly Arg Ser Thr Pro Leu Ala Val Gly Ser Val 
325 330 335 

Lys Thr Asn Val Gly His Leu Glu Gly Ala Ala Gly lie Val Gly Leu 
340 345 350 



10 



He Lys Ala Thr Leu Cys Val Arg Lys Gly Glu Leu Val Pro Ser Leu 
355 360 365 



75 



Asn Phe Ser Thr Pro Asn Pro Asp He Pro Leu Asp Asp Leu Arg Leu 
370 375 380 

Arg Val Gin Thr Glu Arg Gin Glu Trp Asn Glu Glu Asp Asp Arg Pro 
385 390 395 400 

Arg Val Ala Gly Val Ser Ser Phe Gly Met Gly Gly Thr Asn Val His 
405 410 415 



20 



25 



30 



35 



40 



45 



50 
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Leu Val He Ala Glu Ala Pro Ala Ala Ala Gly Ser Ser Gly Ala Gly 
420 425 430 

Gly Ser Gly Ala Gly Ser Gly Ala Gly He Ser Ala Val Ser Gly Val 
435 440 445 

Val Pro Val Val Val Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala 
450 455 460 

Ala Gly Arg Leu Ala Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala 
465 470 475 480 

Asp Val Ala Val Thr Met Ala Asp Arg Ser Arg Phe Gly Tyr Arg Ala 
485 490 495 

Val Val Leu Ala Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala 
500 505 510 

Leu Ala Gly Gly Asp Pro Asp Ala Gly Val Val Thr Gly Ala Val Leu 
515 520 525 

Asp Gly Gly Val Val Val Gly Ala Ala Pro Gly Gly Ala Gly Ala Ala 
530 535 540 

Gly Gly Ala Gly Ala Ala Gly Gly Ala Gly Gly Gly Gly Val Val Leu 
545 550 555 560 

Val Phe Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly Leu 
565 570 " 575 

Leu Gly Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala Arg 
580 585 ~ 590 

Ala Leu Ser Val His Val Gly Trp Asp Leu Leu Glu Val Val Ser Gly 
595 600 605 

Gly Ala Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr Trp Ala 
610 615 620 

Val Met Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp Val 
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625 



630 



635 



640 



Ala Ala Val Val Gly His Ser Gin Gly Glu lie Ala Ala Ala Thr Val 
645 650 655 

Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu Arg 
660 665 670 

Ala Gly Leu lie Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala Ala 
675 680 685 

Val Pro Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp Pro 
690 695 700 

Gly Val Glu Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val Ser 
705 710 715 720 

Gly Asp Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala Glu 
725 730 735 

Gly Val Gin Ala Arg Leu lie Pro Val Asp Tyr Ala Ser His Ser Arg 
740 745 750 

His Val Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly lie 
755 760 765 

Arg Pro Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly Glu 
770 775 780 

Gin Pro Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn Leu 
785 790 795 800 

Arg Asn Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu Glu 
805 810 815 

Gly His Arg Arg Phe He Glu Val Ser Ala His Pro Val Leu Val His 
820 825 830 

Ala He Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala Thr 
835 840 845 

Gly Thr Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr Ser 
850 855 860 

Thr Ala Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro Ala 
865 870 875 880 

Leu Pro Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn His 
885 890 895 

His His Tyr Trp Leu Asp Thr He Asp Gly Gly Gly Gly Asp Asp Ala 
900 905 910 

Thr Gin Glu Lys Glu Ser Gly Pro Leu Thr Arg Glu Leu Arg Gly Leu 
915 920 925 

Pro Ser Ser Gin Lys Gin Leu Gly Phe Leu Leu Asp Leu Val Cys Arg 
930 935 940 



His Thr Ala Val Val Leu Gly Leu Asp Thr Ala Ala Glu Val Asp Pro 
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945 950 955 



960 



Asp Leu Ser Phe Lys Lys Gin Gly He Gin Ser Met Thr Gly Val Glu 
965 970 975 

Leu Arg Asn Arg Leu Leu Thr Glu Thr Gly Leu Ala Leu Pro Thr Thr 
980 985 990 

Leu Val Tyr Asp Arg Pro Thr Pro Arg Ala Leu Ala Gin Phe Leu His 
995 1000 1005 

Thr Glu Leu Leu Asp Gly Ser Pro Ser Gly Ser Val Leu Ala Pro Ala 
1010 1015 1020 

Gin Lys Ser Phe Glu Ala Gly Gly Pro Gly Val Leu Ser Ser Ala Ala 
1025 1030 1035 1040 

Val Gly Val Ser Asp Ala Arg Gly Gly Ser Arg Asp Asp Asp Asp Pro 
1045 1050 * 1055 

He Ala He Val Gly Val Gly Cys Arg Leu Pro Gly Gly Val Asp Ser 
1060 1065 1070 

Arg Ala Ala Leu Trp Glu Leu Leu Glu Ser Gly Ala Asp Ala He Ser 
1075 1080 1085 

Ser Phe Pro Thr Asp Arg Gly Trp Asp Leu Asp Gly Leu Tyr Asp Pro 
1090 1095 noo 

Glu Pro Gly Thr Pro Gly Lys Thr Tyr Val Arg Glu Gly Gly Phe Leu 
H05 mo 1115 n2o 

His Ser Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg 
1125 mo " H35 

Glu Ala Thr Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Ala Ser 
H40 H45 H50 

Trp Glu Ala Leu Glu Asp Ala Gly Val Leu Pro Glu Ser Leu Arg Gly 
II 55 1160 U65 

Gly Asp Ala Gly Val Phe Val Gly Ala Thr Ala Pro Glu Tyr Gly Pro 
1170 H75 H80 

Arg Leu His Glu Gly Ala Asp Gly Tyr Glu Gly Tyr Leu Leu Thr Gly 
1185 H90 1195 1200 

Thr Thr Ala Ser Val Ala Ser Gly Arg He Ala Tyr Thr Leu Gly Thr 
1205 1210 1215 

Gly Gly Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val 
12 20 1225 ~ 1230 

Ala Leu His Leu Ala Val Gin Ala Leu Arg Arg Gly Glu Cys Gly Leu 
1235 1240 1245 

Ala Leu Ala Gly Gly Ala Thr Val Met Ser Gly Pro Gly Met Phe Val 
1250 1255 126O 

Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Met Pro 
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1265 1270 1275 1280 

Phe Ser Ala Asp Ala Asp Gly Thr Ala Trp Ser Glu Gly Val Ala Val 
1285 1290 1295 

Leu Ala Leu Glu Arg Leu Ser Asp Ala Arg Arg Ala Gly His Arg Val 
1300 1305 1310 

Leu Gly Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn 
1315 1320 1325 

Gly Leu Thr Ala Pro Asn Arg Ser Ala Gin Glu Gly Val He Arg Ala 
1330 1335 1340 

Ala Leu Ala Asp Ala Gly Leu Ala Pro Gly Asp Val Asp Ala Val Glu 
15 1345 1350 1355 1360 

Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro He Glu Ala Ser Ala 
1365 1370 1375 



10 



20 



35 



Leu Leu Ala Thr Tyr Gly Arg Glu Arg Val Gly Asp Pro Leu Trp Leu 
1380 1385 1390 

Gly Ser Leu Lys Ser Asn Val Gly His Thr Gin Ala Ala Ala Gly Ala 
1395 1400 1405 

25 Ala Gly Val Val Lys Met Leu Leu Ala Leu Glu His Gly Thr Leu Pro 

1410 1415 1420 

Arg Thr Leu His Ala Asp Arg Pro Ser Thr His Val Asp Trp Ser Ser 
1425 1430 1435 1440 

30 

Gly Thr Val Ala Leu Leu Ala Glu Ala Arg Arg Trp Pro Arg Arg Ser 
1445 1450 1455 

Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly He Ser Gly Thr 
1460 1465 1470 

Asn Ala His Leu He He Glu Glu Ala Pro Glu Trp Val Glu Asp He 
1475 1480 1485 

Asp Gly Val Ala Ala Pro Asp Arg Gly Thr Ala Asp Ala Ala Ala Pro 
1490 1495 1500 

Ser Pro Leu Leu Leu Ser Ala Arg Ser Glu Gly Ala Leu Arg Ala Gin 
1505 1510 1515 1520 

Ala Val Arg Leu Gly Glu Tyr Val Glu Arg Val Gly Ala Asp Pro Arg 
45 1525 1530 1535 

Asp Val Ala Tyr Ser Leu Ala Ser Thr Arg Thr Leu Phe Glu His Arg 
1540 1545 1550 

so Ala Val Val Pro Cys Gly Gly Arg Gly Glu Leu Val Ala Ala Leu Gly 

1555 1560 1565 

Gly Phe Ala Ala Gly Arg Val Ser Gly Gly Val Arg Ser Gly Arg Ala 
1570 1575 1580 



40 



55 



Val Pro Gly Gly Val Gly Val Leu Phe Thr Gly Gin Gly Ala Gin Trp 
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1585 1590 1595 



1600 



Val Gly Met Gly Arg Gly Leu Tyr Ala Gly Gly Gly Val Phe Ala Glu 
1605 1610 1615 

Val Leu Asp Glu Val Leu Ser Met Val Gly Glu Val Asp Gly Arg Ser 
1620 1625 1630 

Leu Arg Asp Val Met Phe Gly Asp Val Asp Val Asp Ala Gly Ala Gly 
1635 1640 1645 

Ala Asp Ala Gly Ala Gly Ala Gly Ala Gly Val Gly Ser Gly Ser Gly 
1650 1655 1660 

Ser Val Gly Gly Leu Leu Gly Arg Thr Glu Phe Ala Gin Pro Ala Leu 
1665 1670 1675 1680 

Phe Ala Leu Glu Val Ala Leu Phe Arg Ala Leu Glu Ala Arg Gly Val 
1685 1690 1695 

Glu Val Ser Val Val Leu Gly His Ser Val Gly Glu Val Ala Ala Ala 
1700 1705 1710 

Tyr Val Ala Gly Val Leu Ser Leu Gly Asp Ala Val Arg Leu Val Val 
1715 1720 1725 

Ala Arg Gly Gly Leu Met Gly Gly Leu Pro Val Gly Gly Gly Met Trp 
1730 1735 1740 

Ser Val Gly Ala Ser Glu Ser Val Val Arg Gly Val Val Glu Gly Leu 
1745 1750 1755 1760 

Gly Glu Trp Val Ser Val Ala Ala Val Asn Gly Pro Arg Ser Val Val 
1765 1770 1775 

Leu Ser Gly Asp Val Gly Val Leu Glu Ser Val Val Ala Ser Leu Met 
1780 1785 1790 

Gly Asp Gly Val Glu Cys Arg Arg Leu Asp Val Ser His Gly Phe His 
1795 1800 1805 

Ser Val Leu Met Glu Pro Val Leu Gly Glu Phe Arg Gly Val Val Glu 
181° 1815 1820 

Ser Leu Glu Phe Gly Arg Val Arg Pro Gly Val Val Val Val Ser Glv 
1825 1830 1835 i 8 4o 

Val Ser Gly Gly Val Val Gly Ser Gly Glu Leu Gly Asp Pro Gly Tyr 
1845 1850 ^ 1855 

Trp Val Arg His Ala Arg Glu Ala Val Arg Phe Ala Asp Gly Val Gly 
I860 1865 1870 

Val Val Arg Gly Leu Gly Val Gly Thr Leu Val Glu Val Gly Pro His 
!875 1880 1885 

Gly Val Leu Thr Gly Met Ala Gly Glu Cys Leu Gly Ala Gly Asp Asp 
1890 1895 1900 

Val Val Val Val Pro Ala Met Arg Arg Gly Arg Ala Glu Arg Glu Val 
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1905 1910 1915 1920 

Phe Glu Ala Ala Leu Ala Thr Val Phe Thr Arg Asp Ala Gly Leu Asp 
1925 1930 1935 

Ala Thr Ala Leu His Thr Gly Ser Thr Gly Arg Arg He Asp Leu Pro 
1940 1945 1950 

Thr Tyr Pro Phe Gin Arg Asp Arg Tyr Trp Leu Asp Pro Val Arg Thr 
1955 1960 * 1965 

Ala Val Thr Gly Val Glu Pro Ala Gly Ser Pro Ala Asp Ala Arg Ala 
1970 1975 1980 

Thr Glu Arg Gly Arg Ser Thr Thr Ala Gly He Arg Tyr Arg Val Ala 
1985 1990 1995 2000 

Trp Gin Pro Ala Val Val Asp Arg Gly Asn Pro Gly Pro Ala Gly His 
2005 2010 2015 

Val Leu Leu Leu Ala Pro Asp Glu Asp Thr Ala Asp Ser Gly Leu Ala 
2020 2025 2030 

Pro Ala He Ala Arg Glu Leu Ala Val Arg Gly Ala Glu Val His Thr 
2035 2040 2045 

Val Ala Val Pro Val Gly Thr Gly Arg Glu Ala Ala Gly Asp Leu Leu 
2050 2055 2060 

Arg Ala Ala Gly Asp Gly Ala Ala Arg Ser Thr Arg Val Leu Trp Leu 
2065 2070 2075 2080 

Ala Pro Ala Glu Pro Asp Ala Ala Asp Ala Val Ala Leu Val Gin Ala 
2085 2090 2095 

Leu Gly Glu Ala Val Pro Glu Ala Pro Leu Trp He Thr Thr Arg Glu 
2100 2105 2110 

Ala Ala Ala Val Arg Pro Asp Glu Thr Pro Ser Val Gly Gly Ala Gin 
2115 2120 2125 

Leu Trp Gly Leu Gly Gin Val Ala Ala Leu Glu Leu Gly Arg Arg Trp 
2130 2135 2140 

Gly Gly Leu Ala Asp Leu Pro Gly Ser Ala Ser Pro Ala Val Leu Arg 
2145 2150 2155 2160 

Thr Phe Val Gly Ala Leu Leu Ala Gly Gly Glu Asn Gin Phe Ala Val 
2165 2170 2175 

Arg Pro Ser Gly Val His Val Arg Arg Val Val Pro Ala Pro Val Pro 
2180 2185 2190 

Val Pro Ala Ser Ala Arg Thr Val Thr Thr Ala Pro Ala Thr Ala Val 
2195 2200 2205 

Gly Glu Asp Ala Arg Asn Asp Thr Ser Asp Val Val Val Pro Asp Asp 
2210 2215 2220 

Arg Trp Ser Ser Gly Thr Val Leu He Thr Gly Gly Thr Gly Ala Leu 
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2225 2230 2235 2240 

Gly Ala Gin Val Ala Arg Arg Leu Ala Arg Ser Gly Ala Ala Axg Leu 
2245 2250 2255 

Leu Leu Val Gly Arg Arg Gly Ala Ala Gly Pro Gly Val Gly Glu Leu 
2260 2265 2270 

Val Glu Glu Leu Thr Ala Leu Gly Ser Glu Val Ala Val Glu Ala Cys 
2275 2280 2285 

Asp Val Ala Asp Arg Asp Ala Leu Ala Ala Leu Leu Ala Gly Leu Pro 
2290 2295 2300 

Glu Glu Arg Pro Leu Val Ala Val Leu His Ala Ala Gly Val Leu Asp 
2305 2310 2315 2320 

Asp Gly Val Leu Asp Ser Leu Thr Ser Asp Arg Val Asp Ala Val Leu 
2325 2330 2335 

Arg Asp Lys Val Thr Ala Ala Arg His Leu Asp Glu Leu Thr Ala Asp 
2340 2345 2350 

Leu Pro Leu Asp Ala Phe Val Leu Phe Ser Ser He Val Gly Val Trp 
2355 2360 2365 

Gly Asn Gly Gly Gin Ala Val Tyr Ala Ala Ala Asn Ala Ala Leu Asp 
2370 2375 2380 

Ala Leu Ala Gin Arg Arg Arg Ala Arg Gly Ala Arg Ala Ala Ser He 
2385 2390 2395 2400 

Ala Trp Gly Pro Trp Ala Gly Ala Gly Met Ala Ser Gly Thr Ala Ala 
2405 2410 2415 

Lys Ser Phe Glu Arg Asp Gly Val Thr Ala Leu Asp Pro Glu Arg Ala 
2420 2425 2430 

Leu Asp Val Leu Asp Asp Val Val Gly Ala Gly Gly Thr Ser Ala Ala 
2435 2440 2445 

Gly ^L His Ala Ala Gly Glu Ser Ser Leu Leu Va l Ala Asp Val Asp 
2450 2455 2460 

Trp Glu Thr Phe Val Gly Arg Ser Val Thr Arg Arg Thr Trp Ser Leu 
2465 2470 2475 2480 

Phe Asp Gly Val Ser Ala Ala Arg Ser Ala Arg Ala Gly His Ala Ala 
2485 2490 2495 

Asp Asp Arg Ala Ala Leu Thr Pro Gly Thr Arg Pro Gly Asp Gly Ala 
2500 2505 2510 

Pro Gly Gly Ser Gly Gin Asp Gly Gly Glu Gly Arg Pro Trp Leu Ser 
2515 2520 2525 

Val Gly Pro Ser Pro Ala Glu Arg Arg Arg Ala Leu Leu Thr Leu Val 
25 30 2535 2 540 

Arg Ser Glu Ala Ala Gly He Leu Arg His Ala Ser Ala Asp Ala Val 
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2545 2550 2555 2560 

Asp Pro Glu Leu Ala Phe Arg Ser Ala Gly Phe Asp Ser Leu Thr Val 
2565 2570 2575 

Leu Glu Leu Arg Asn Arg Leu Thr Ala Ala Thr Gly Leu Asn Leu Pro 
2580 2585 2590 

Asn Thr Leu Leu Phe Asp His Pro Thr Pro Leu Ser Leu Ala Ser His 
2595 2600 2605 

Leu His Asp Glu Leu Phe Gly Pro Asp Ser Glu Ala Glu Pro Ala Ala 
2610 2615 2620 

Ala Ala Pro Thr Pro Val Met Ala Asp Glu Arg Glu Pro lie Ala lie 
2625 2630 2635 2640 

Val Gly Met Ala Cys Arg Tyr Pro Gly Gly Val Ala Ser Pro Asp Asp 
2645 2650 2655 

Leu Trp Asp Leu Val Ala Gly Asp Gly His Thr Leu Ser Pro Phe Pro 
2660 2665 2670 

Ala Asp Arg Gly Trp Asp Val Glu Gly Leu Tyr Asp Pro Glu Pro Gly 
2675 2680 2685 

Val Pro Gly Lys Ser Tyr Val Arg Glu Gly Gly Phe Leu Arg Ser Ala 
2690 2695 2700 

Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala Thr 
2705 2710 2715 ~ 2720 

Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala 
2725 2730 2735 

Leu Glu Arg Ala Gly He Val Pro Asp Ser Leu Arg Gly Thr Arg Thr 
2740 2745 2750 

Gly Val Phe Ser Gly He Ser Gin Gin Asp Tyr Ala Thr Gin Leu Gly 
2755 2760 2765 

Asp Ala Ala Asp Thr Tyr Gly Gly His Val Leu Thr Gly Thr Leu Gly 
2770 2775 2780 

Ser Val He Ser Gly Arg Val Ala Tyr Ala Leu Gly Leu Glu Gly Pro 
2785 2790 2795 2800 

Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His 
2805 2810 2815 

Leu Ala Val Gin Ser Leu Arg Arg Gly Glu Cys Asp Leu Ala Leu Ala 
2820 2825 2830 

Gly Gly Val Thr Val Met Ala Thr Pro Thr Val Phe Val Glu Phe Ser 
2835 2840 2845 

Arg Gin Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe Ala Glu 
2850 2855 2860 

Gly Ala Asp Gly Thr Ala Trp Ala Glu Gly Val Gly Val Leu Leu Val 
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2865 2870 2875 2880 

Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Arg Val Leu Ala Val 
5 2885 2890 2895 

Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr 
2900 2905 2910 

Ala Pro Ser Gly Pro Ala Gin Gin Arg Val He Arg Glu Ala Leu Ala 
10 2915 2920 2925 

Asp Ala Gly Leu Val Pro Ala Asp Val Asp Val Val Glu Ala His Gly 
2930 2935 2940 

1K Thr G1 y Ala Leu Gly Asp Pro He Glu Ala Gly Ala Leu Leu Ala 

2945 2950 2955 2960 

Thr Tyr Gly Arg Glu Arg Val Gly Asp Pro Leu Trp Leu Gly Ser Leu 
2965 2970 2975 

20 L ys Ser Asn He Gly His Ala Gin Ala Ala Ala Gly Val Gly Gly Val 

2980 2985 2990 

He Lys Val Val Gin Gly Met Arg His Gly Ser Leu Pro Arg Thr Leu 
2995 3000 3005 

25 His Val As P Ala Pro Ser Ser Lys Val Glu Trp Ala Ser Gly Ala Val 

3010 3015 3020 

Glu Leu Leu Thr Glu Thr Arg Ser Trp Pro Arg Arg Val Glu Arg Val 
3025 3030 3035 3040 

Arg Arg Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala His 
3045 3050 3055 

Val Val Leu Glu Glu Ala Pro Ala Glu Ala Gly Ser Glu His Gly Asp 
3060 3065 3070 

Gly Pro Glu Pro Glu Arg Pro Asp Ala Val Thr Gly Pro Leu Ser Trp 
3075 3080 3085 

Val Leu Ser Ala Arg Ser Glu Gly Ala Leu Arg Ala Gin Ala Val Arg 
40 3090 3095 3100 

Leu Arg Glu Cys Val Glu Arg Val Gly Ala Asp Pro Arg Asp Val Ala 
3105 3110 3115 3120 



30 



35 



45 



50 



55 



Gly Ser Leu Val Val Ser Arg Ala Ser Phe Gly Glu Arg Ala Val Val 
3125 3130 " 3135 

Val Gly Arg Gly Arg Glu Glu Leu Leu Ala Gly Leu Asp Val Val Ala 
3140 3145 3150 

Ala Gly Ala Pro Val Gly Val Ser Ser Gly Ala Gly Ala Val Val Arg 
3155 3160 3165 

Gly Ser Ala Val Arg Gly Arg Gly Val Gly Val Leu Phe Thr Gly Gin 
3170 3175 3180 

Gly Ala Gin Trp Val Gly Met Gly Arg Gly Leu Tyr Ala Gly Gly Gly 



207 



EP 0 791 655 A2 

3185 3190 3195 3200 

Val Phe Ala Glu Val Leu Asp Glu Val Leu Ser Val Val Gly Glu Val 
3205 3210 3215 

Asp Gly Arg Ser Leu Arg Asp Val Met Phe Ala Asp Ala Asp Ser Val 
3220 3225 3230 

Leu Gly Gly Leu Leu Gly Arg Thr Glu Phe Ala Gin Pro Ala Leu Phe 
3235 3240 3245 

Ala Leu Glu Val Ala Leu Phe Arg Ala Leu Glu Ala Arg Gly Val Glu 
3250 3255 3260 

Val Ser Val Val Leu Gly His Ser Val Gly Glu Val Ala Ala Ala Tyr 
3265 3270 3275 3280 

Val Ala Gly Val Leu Ser Leu Gly Asp Ala Val Arg Leu Val Val Ala 
3285 3290 3295 

Arg Gly Gly Leu Met Gly Gly Leu Pro Val Gly Gly Gly Met Trp Ser 
3300 3305 3310 

Val Gly Ala Ser Glu Ser Val Val Arg Gly Val Val Glu Gly Leu Gly 
3315 3320 3325 

Glu Trp Val Ser Val Ala Ala Val Asn Gly Pro Arg Ser Val Val Leu 
3330 3335 3340 

Ser Gly Asp Val Gly Val Leu Glu Ser Val Val Val Thr Leu Met Gly 
3345 3350 3355 3360 

Asp Gly Val Glu Cys Arg Arg Leu Asp Val Ser His Gly Phe His Ser 
3365 3370 3375 

Val Leu Met Glu Pro Val Leu Gly Glu Phe Arg Gly Val Val Glu Ser 
3380 3385 3390 

Leu Glu Phe Gly Arg Val Arg Pro Gly Val Val Val Val Ser Gly Val 
3395 3400 3405 

Ser Gly Gly Val Val Gly Ser Gly Glu Leu Gly Asp Pro Gly Tyr Trp 
3410 3415 3420 

Val Arg His Ala Arg Glu Ala Val Arg Phe Ala Asp Gly Val Gly Val 
3425 3430 3435 3440 

Val Arg Gly Leu Gly Val Gly Thr Leu Val Glu Val Gly Pro His Gly 
3445 3450 3455 

Val Leu Thr Gly Met Ala Gly Gin Cys Leu Glu Ala Gly Asp Asp Val 
3460 3465 3470 

Val Val Val Pro Ala Met Arg Arg Gly Arg Pro Glu Arg Glu Val Phe 
3475 3480 3485 

Glu Ala Ala Leu Ala Thr Val Phe Thr Arg Asp Ala Gly Leu Asp Ala 
3490 3495 3500 

Thr Thr Leu His Thr Gly Ser Thr Gly Arg Arg He Asp Leu Pro Thr 
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3505 3510 3515 3520 

Tyr Pro Phe Gin His Asn Arg Tyr Trp Ala Thr Gly Ser Val Thr Gly 
3525 3530 3535 

Ala Thr Gly Thr Ser Ala Ala Ala Arg Phe Gly Leu Glu Trp Lys Asp 
3540 3545 ~ 355 o 

His Pro Phe Leu Ser Gly Ala Thr Pro He Ala Gly Ser Gly Ala Leu 
3555 35 6 o 3565 

Leu Leu Thr Gly Arg Val Gly Leu Ala Ala His Pro Trp Leu Ala Asp 
3570 3575 3 5 8 o 

His Ala He Ser Gly Thr Val Leu Leu Pro Gly Thr Ala lie Ala Asp 
3585 3590 3595 3600 

Leu Leu Leu Arg Ala Val Glu Glu Val Gly Ala Gly Gly Val Glu Glu 
3605 36io 3615 

Leu Thr Leu His Glu Pro Leu Leu Leu Pro Glu Arg Gly Gly Leu His 
3«0 3625 3630 

Val Gin Val Leu Val Glu Ala Ala Asp Glu Gin Gly Arg Arg Ala Val 
363 5 3640 3645 

A1S T«„ Ala Ala *** Pr ° Glu Gly Pro G1 y ^9 As P Glu Glu Gin 

3650 3655 3660 

Glu Trp Thr Arg His Ala Glu Gly Val Leu Thr Ser Thr Glu Thr Ala 
3665 3670 3675 3 680 

Val Pro Asp Met Gly Trp Ala Ala Gly Ala Trp Pro Pro Pro Gly Ala 
3685 3690 3595 

Glu Pro lie Asp Val Glu Glu Leu Tyr Asp Ala Phe Ala Ala Asp Gly 
3700 3705 3 710 

Tyr Gly Tyr Gly Pro Ala Phe Thr Ala Leu Ser Gly Val Trp Arg Leu 
3715 3720 3725 

A 7? ft G1U PhS A1S G1U Val Pro Ala G ly Gly Ala Gly 

3730 . 3735 3740 

Thr_Thr Gly Asp Gly Phe Gly Val His Pro Ala Leu Phe Asp Ala Ala 

3750 3755 3 760 

Leu His Pro Trp Arg Ala Gly Gly Leu Leu Pro Asp Thr Gly Gly Thr 
37 65 3770 3775 

Thr Trp Ala Pro Phe Ser Trp Gin Gly lie Ala Leu His Thr Thr Gly 
3780 3785 3790 

Ala Glu Thr Leu Arg Val Arg Leu Ala Pro Ala Ala Gly Gly Thr Glu 
3795 3800 3805 

A « a n Phe Val Gln Ala AU **» Pro Ala Gly Thr Pro Val Leu 
3810 3815 3820 

Thr Leu Asp Ala Leu Leu Leu Arg Pro Val Thr Leu Gly Arg Ala Asp 
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3825 3830 3835 3840 

Ala Pro Gin Pro Leu Tyr Arg Val Asp Trp Gin Pro Val Gly Gin Gly 
3845 3850 3855 

Thr Glu Ala Ser Gly Ala Gin Gly Trp Thr Val Leu Gly Gin Ala Ala 
3860 3865 3870 

Ala Glu Thr Val Ala Gin Pro Ala Ala His Ala Asp Leu Thr Ala Leu 
3875 3880 3885 

Arg Thr Ala Val Ala Ala Ala Gly Thr Pro Val Pro Arg Leu Val Val 
3890 3895 3900 

Val Ser Pro Val Asp Thr Arg Leu Asp Glu Gly Pro Val Leu Ala Asp 
3905 3910 3915 3920 

Ala Glu Ala Arg Ala Arg Ala Gly Asp Gly Trp Asp Asp Asp Pro Leu 
3925 3930 * 3935 

Arg Val Ala Leu Gly Arg Gly Leu Thr Leu Val Arg Glu Trp Val Glu 
3940 3945 3950 

Asp Glu Arg Leu Ala Asp Ser Arg Leu Val Val Leu Thr Arg Gly Ala 
3955 3960 3965 

Val Ala Ala Gly Pro Gly Asp Val Pro Asp Leu Thr Gly Ala Ala Leu 
3970 3975 3980 

Trp Gly Leu Leu Arg Ser Ala Gin Ser Glu Tyr Pro Asp Arg Phe Thr 
3985 3990 3995 4000 

Leu He Asp Val Asp Asp Ser Pro Glu Ser Arg Ala Ala Leu Pro Arg 
4005 4010 4015 

Ala Leu Gly Ser Ala Glu Arg Gin Leu Ala Leu Arg Thr Gly Asp Val 
4020 4025 4030 

Leu Ala Pro Ala Leu Val Pro Met Ala Thr Arg Pro Ala Glu Thr Thr 
4035 4040 4045 

Pro Ala Thr Ala Val Ala Ser Ala Thr Thr Gin Thr Gin Val Thr Ala 
4050 4055 4060 

Pro Ala Pro Asp Asp Pro Ala Ala Asp Ala Val Phe Asp Pro Ala Gly 
4065 4070 4075 4080 

Thr Val Leu He Thr Gly Gly Thr Gly Ala Leu Gly Arg Arg Val Ala 
4085 4090 " 4095 

Ser His Leu Ala Arg Arg Tyr Gly Val Arg His Met Leu Leu Val Ser 
4100 4105 4110 

Arg Arg Gly Pro Asp Ala Pro Glu Ala Gly Pro Leu Glu Arg Glu Leu 
4115 4120 4125 

Ala Gly Leu Gly Val Thr Ala Thr Phe Leu Ala Cys Asp Leu Thr Asp 
4130 4135 4140 

He Glu Ala Val Arg Lys Ala Val Ala Ala Val Pro Ser Asp His Pro 
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4145 4150 4155 



4160 



Leu Thr Gly Val Val His Thr Ala Gly Val Leu Asp Asp Gly Ala Leu 
4165 4170 4175 

Thr Gly Leu Thr Arg Gin Arg Leu Asp Thr Val Leu Arg Pro Lys Ala 
4180 4185 4190 

Asp Ala Val Arg Asn Leu His Glu Ala Thr Leu Asp Arg Pro Leu Arg 
4195 4200 4205 

Ala Phe Val Leu Phe Ser Ala Ala Ala Gly Leu Leu Gly Arg Pro Glv 
4 210 4215 4220 

Gin Ala Ser Tyr Ala Ala Ala Asn Ala Val Leu Asp Ala Leu Ala Glv 
4225 4230 4235 4240 

Ala Arg Arg Ala Ala Gly Leu Pro Ala Val Ser Leu Ala Trp Gly Leu 
4245 4250 4255 

Trp Asp Glu Gin Thr Gly Met Ala Gly Gly Leu Asp Glu Met Ala Leu 
4260 4265 4270 

Arg Val Leu Arg Arg Asp Gly He Ala Ala Met Pro Pro Glu Gin Glv 
4275 4280 42 85 

Leu Glu Leu Leu Asp Leu Ala Leu Thr Gly His Arg Asp Gly Pro Ala 
4290 4295 4300 
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Val Leu Val Pro Leu Leu Leu Asp 
4305 4310 

Lys Glu Arg Gly Ala Ala Thr Met 
4325 

Pro Ala Ala Leu Arg Arg Ser Gly 
4340 



Gly Ala Ala Leu Arg Arg Thr Ala 
4315 4320 

Ser Pro Leu Leu Arg Ala Leu Leu 
4330 4335 

Gly Ala Gly Ala Pro Ala Ala Ala 
4345 4350 



Asp Arg His Gly Lys Glu Ala Asp Pro Gly Ala Gly Arg Leu Ala Gly 
4355 4360 4365 

Met Val Ala Leu Glu Ala Ala Glu Arg Ser Ala Ala Val Leu Glu Leu 
4370 4375 4380 



Val Thr Glu Gin Val Ala Glu Val Leu Gly Tyr Ala Ser Ala Ala Glu 
4385 4390 4395 4400 

lie Glu Pro Glu Arg Pro Phe Arg Glu lie Gly Val Asp Ser Leu Ala 
4405 4410 4415 

Ala Val Glu Leu Arg Asn Arg Leu Ser Arg Leu Val Gly Leu Arg Leu 
4420 4425 4430 

Pro Thr Thr Leu Ser Phe Asp His Pro Thr Pro Lys Asp Met Ala Gin 
4435 4440 4445 

His He Asp Gly Gin Leu Pro Arg Pro Ala Gly Ala Ser Pro Ala Asp 
4450 4455 4460 

Ala Ala Leu Glu Gly He Gly Asp Leu Ala Arg Ala Val Ala Leu Leu 
4465 4470 4475 4480 

Gly Thr Gly Asp Ala Arg Arg Ala Glu Val Arg Glu Gin hen Val Gly 
4485 4490 4495 

Leu Leu Ala Ala Leu Asp Pro Pro Gly Arg Thr Gly Thr Ala Ala Pro 
4500 4505 4510 

Gly Val Pro Ser Gly Ala Asp Gly Ala Glu Pro Thr Val Thr Asp Arg 
4515 4520 4525 

Leu Asp Glu Ala Thr Asp Asp Glu He Phe Ala Phe Leu Asp Glu Gin 
4530 4535 4540 

Leu * 
4545 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CGAGGCCGGC GGGCC 



Claims 

1. A DNA molecule comprising an isolated DNA sequence that encodes a tylactone synthase domain. 

2. The DNA molecule as claimed in Claim 1 wherein the isolated DNA sequence is selected from the group consisting 
of: 

nucleotides 942 to 2156, 2571 to 3557, 3675 to 3929, 3993 to 5264, 5631 to 6617, 7410 to 7949, 8220 to 
B471, 8541 to 9812, 10260 to 11246, 11319 to 11876, 12861 to 13415, 13719 to 13970, 14411 to 15697, 16055 
to 17122, 17198 to 17794, 18584 to 19138, 19415 to 19666, 20136 to 21404, 21771 to 22757, 23541 to 24077, 
24360 to 24611, 24675 to 25949, 26292 to 27284, 27360 to 27917, 28767 to 29813, 29829 to 30368, 30651 to 
30902. 31337 to 32608, 32975 to 33961, 34694 to 35236, 35492 to 35743, 36360 to 37631, 37989 to 38987 
39759 to 4031 3, 40575 to 40826, and 41 235 to 41 333 all in SEQ ID NO:1 . 

3. The DNA molecule as claimed in Claim 1 wherein the isolated DNA sequence is selected from the group consisting 
of: 

nucleotides 942 to 8471 , 8541 to 1 3970, 1 441 1 to 1 9666, 201 36 to 2461 1 , 24675 to 30902, 31 337 to 35743 
and 36360 to 40826 all in SEQ ID NO:1. 

4. The DNA molecule as claimed in Claim 1 wherein the isolated DNA sequence is selected from the group consisting 
of: 

nucleotides 81 6 to 14234, 14351 to 19945, 20010to 311 99, 31232 to 36067, and 36249 to41774all in SEQ 
ID NO:1. 

5. The DNA molecule as claimed in Claim 1 wherein the isolated DNA sequence is SEQ ID NO:1 . 

6. A polypeptide comprising an amino acid sequence that consists of a tylactone synthase domain. 

7. The polypeptide of claim 6 wherein the amino acid sequence is selected from the group consisting of: 

(a) amino acids 43 to 447, 586 to 91 4, 954 to 1 038, 1 060 to 1 483, 1 606 to 1 934, 21 99 to 2378, 2469 to 2552 
2576 to 2999, 3149 to 3477, 3502 to 3687, 4016 to 4200, and 4302 to 4385 in SEQ ID NO:2; ' 

(b) amino acids 21 to 449, 569 to 924, 950 to 1148, 1412 to 1596, and 1689 to 1772 in SEQ ID NO:3; 

(c) amino acids 43 to 465, 588 to 91 6, 1 1 78 to 1 356, and 1 451 to 1 534, 1 556 to 1 980, 2095 to 2425 2451 to 
2636, 3274 to 3453, and 3548 to 3631 in SEQ ID NO:4; 

(d) amino acids 36 to 459, 582 to 910, 1155 to 1335, and 1421 to 1504 in SEQ ID NO:5; and 

(e) amino acids 38 to 461, 581 to 913, 1171 to 1355, 1443 to 1526, and 1663 to 1695 in SEQ ID NO:6. 

8. The polypeptide of claim 6 wherein the amino acid sequence is selected from the group consisting of: 

(a) amino acids 1060 to 2552 and 2576 to 4385 in SEQ ID NO:2; 

(b) amino acids 21 to 1772 in SEQ ID NO:3; 
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(c) amino acids 43 to 1534 and 1556 to 3631 in SEQ ID NO:4; 

(d) amino acids 36 to 1504 in SEQ ID NO:5; and 

(e) amino acids 38 to 1526 in SEQ ID NO:6. 

s 9. The polypeptide of claim 6 wherein the amino acid sequence is selected from the group consisting of SEQ I D NO: 
2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, and SEQ ID NO:6. 

10. A recombinant DNA vector comprising a DNA molecule of Claim 1 . 

10 11. A recombinant DNA vector comprising a DNA molecule of Claim 2. 

12. A recombinant DNA vector comprising a DNA molecule of Claim 3. 

13. A recombinant DNA vector comprising a DNA molecule of Claim 4. 

15 

14. A recombinant DNA vector comprising a DNA molecule of Claim 5. 

15. A recombinant DNA vector of Claim 1 0 which is NRRL B-18688. 
20 16. A recombinant DNA vector of Claim 10 which is NRRL B-18689 

17. A host cell transformed with a recombinant DNA vector of Claim 10. 

18. A host cell transformed with a recombinant DNA vector of Claim 11 . 

25 

19. A host cell transformed with a recombinant DNA vector of Claim 12. 

20. A host cell transformed with a recombinant DNA vector of Claim 13. 
30 21. A host cell transformed with a recombinant DNA vector of Claim 14. 

22. The DNA molecule as claimed in Claim 1 wherein the isolated DNA sequence is SEQ ID NO:7. 

23. The polypeptide of SEQ ID NO:8. 

35 
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